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Abstract. We define two transformations from term rewriting systems (TRSs) to context- 
sensitive TRSs in such a way that termination of the target system implies outermost 
termination of the original system. In the transformation based on 'context extension', 
each outermost rewrite step is modeled by exactly one step in the transformed system. 
This transformation turns out to be complete for the class of left-linear TRSs. The second 
transformation is called 'dynamic labeling' and results in smaller sized context-sensitive 
TRSs. Here each modeled step is adjoined with a small number of auxiliary steps. 

As a result state-of-the-art termination methods for context-sensitive rewriting become 
available for proving termination of outermost rewriting. Both transformations have been 
implemented in Jambox, making it the most successful tool in the category of outermost 
rewriting of the annual termination competition of 2008. 



1. Introduction 

Termination is a key aspect of program correctness, and therefore a widely studied 
subject in term rewriting and program verification. While termination is undecidable in 
general, various automated techniques have been developed for proving termination. One 
of the most powerful techniques is the method of dependency pairs [AGOOj. In |AGL06| 



dependency pairs for context-sensitive rewriting have been introduced, and in jAEF"'"08 the 



dependency pair framework ^GTSK04l IThiOTl iGLlOj has been extended to context-sensitive 
rewriting, thereby extending the class of context-sensitive TRSs for which termination can 
be shown automatically. Context-sensitive rewriting [Luc98] is a restriction on term rewrit- 
ing where rewriting in some fixed arguments of function symbols is disallowed. It offers a 
flexible paradigm to analyze properties of rewrite strategies, in particular of (lazy) evalua- 
tion strategies employed by functional programming languages. 

In this paper context-sensitive rewriting is the target formalism for a transformational 
approach to the problem of outermost termination, that is, termination with respect to 
outermost rewriting. Outermost rewriting is a rewriting strategy where a redex may be 
contracted as long as it is not a proper subterm of another redex occurrence. The main 
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reason for studying outermost termination is its practical relevance: lazy functional pro- 
gramming languages like Miranda |Tur86j . Haskell |PJ03] or Clean [PvEOlj . are based on 
outermost rewriting as an evaluation strategy, and in implementations of rewrite logic such 
as Maude |CELM96] and CafeOBJ [FN97J, outermost rewriting is an optional strategy. 

To illustrate outermost rewriting, and the transformations we propose, we consider the 
term rewriting system Rq consisting of the following rules: 

a ^ f(a) f(f(x)) ^ b (Ro) 

Clearly, this system is not terminating, as witnessed by the infinite rewrite sequence: 

a ^ f (a) ^ f (f(a)) ^ f(f (f(a))) ^ . . . 

However, Rq is outermost terminating. Indeed, the third step in the rewrite sequence above 
is not an outermost step, since the contraction takes place inside another redex. The only 
(maximal) outermost rewrite sequence the term a admits is: 

a 2^ f(a) 2^ f(f(a)) 2^ b (1.1) 

The contribution of the present paper consists of two transformations of arbitrary TRSs 
into context-sensitive TRSs (henceforth also called '/xTRSs') in such a way that rewriting 
in the /xTRS corresponds to outermost rewriting in the original TRS. As a result, advanced 
termination techniques for /iTRSs become available for proving outermost termination. 
Automated termination provers for /xTRSs can directly (without modification, only prepro- 
cessing) be used for proving outermost termination. One of the transformations turns out 
to be complete for the class of quasi- left-linear TRSs, a generalized form of left-linear TRSs, 
see |RZ09j . In other words, termination of the resulting /xTRS is equivalent to outermost 
termination of the original system. 

The transformations are comprised of a variant of semantic labeling [Zan95| . In se- 
mantic labeling the function symbols in a term are labeled by the interpretation of their 
arguments (or a label depending on these values) according to some given semantics. We 
employ semantic labeling to mark symbols at redex positions, and then obtain a //TRS by 
defining a replacement map that disallows rewriting inside arguments of marked symbols. 

We illustrate our use of semantic labeling by means of the TRS Rq given above. We 
choose an algebra with values and 1, indicating the presence of the symbol f: 

A = ({0,1},H) Ial = Ibl = IfK^) = l for XG {0,1} (A) 

We write f*, and say that 'f is marked', if the value of its argument is 1, and just f if the 
value is 0. The symbol a is a redex, and hence it is always marked, while b never is. If 
f is marked it corresponds to a redex position with respect to the rule f(f(x)) — )■ b. For 
example the term f(f(f(a))) is labeled as f*(f*(f(a*))). We obtain a //TRS by forbidding 
rewriting inside the argument of the symbol f*. Since a* is a constant, there is nothing 
to be forbidden. Hence for correctly labeled terms, rewriting inside redex occurrences is 
disallowed, and this corresponds to the strategy of outermost rewriting. 

In order to rewrite labeled terms we have to label the rules of a TRS. Simply labeling 
both sides of a rule does not always work. For example, when we label the rules of Rq using 
the algebra Aq, we obtain the following /xTRS: 

a*^f(a*) f*(f(x))^b f*(f^(x))^b (Ro) 

This system has two instances of the second rule, one for each possible value assigned to 
the variable x. Now, despite of the fact that the original TRS is outermost terminating. 



TRANSFORMING OUTERMOST INTO CONTEXT-SENSITIVE REWRITING * 



the labeled system \Ro\ still admits an infinite rewrite sequence: 

a* ^, f(a'^) ^, f(f(a*)) ^, f(f(f(a*))) ^,... (1.2) 

The reason is that the term f(f(a*)) is not correctly labeled, as its root symbol f should have 
been marked. In [Zan95j this problem is avoided by allowing labeling only with models. 
Roughly speaking, an algebra is a model for a TRS R if left and right-hand sides of all 
rewrite rules of R have equal interpretations. However, this requirement is too strict for 
the purpose of marking redexes, because contraction of a redex at a position p may create 
a redex above p in the term tree, as exemplified by ()1.2p . In fact, for Rq there exists no 
model which is able to distinguish between redex and non-redex positions. Let us explain. 
The rewrite step f (a) — > f (f(a)) creates a redex at the top. The term f (a) is not a redex, and 
therefore its root symbol f should not be marked. On the other hand f(f (a)) is a redex and 
so the outermost f has to be marked. The change of the labeling of a context (here f(n)) 
implies that the interpretation of its arguments a and f(a) cannot be the same. Therefore 
we cannot require the rule a — )• f(a) to preserve the interpretation. 

To that end, we generalize this notion of model and relax the condition [[£]] = [rj to: 

3n. lC[i]j = lC[r]j for ah contexts C of depth > n (1.3) 

Thus rules are allowed to change the interpretation as long as the effect is limited to contexts 
of a bounded depth. We call this depth the C -depth of i ^ r and denote it by 6_/x{i — > r). 
As it turns out, algebras satisfying this weaker requirement ()1.3p . are strong enough to 
recognize redex positions. Such algebras we will call C -models. 

The algebra Aq given above is a C-model for the TRS Rq. As opposed to models, for 
C-models it is no longer sufficient t o si mply label the rules. This is demonstrated by the 



rewrite sequence ()1.2p in the /xTRS Rq an application of the rule a* — > f (a*) in the term 
f(a*) creates the incorrectly labeled term f(f(a*)). 

Therefore, in order to preserve correct labeling, labels in the context of the original 
rewrite step sometimes have to be updated. We present two solutions to this problem: the 
transformation of context extension and the transformation of dynamic labeling. 

In the transformation based on context extension |EH09j , worked out in Section O the 
update of semantic labels is established by prefixing appropriate contexts to both sides of a 
rewrite rule. The depth of these contexts is bounded by the C-depth of the rule. Thus, the 
update of the labels is coded within the context, and no additional rewrite steps are needed. 
As a result of that, every outermost rewrite step in the original system is modeled by exactly 
one rewrite step in the transformed /xTRS. A disadvantage of the transformation, however, 
is that the resulting ^TRS can have a large number of rules arising from the prepending of 
contexts in combination with semantic labeling. 

An alternative solution (and new with respect to [EH09J ) is dynamic labeling, described 
in Section [6) instead of extending rules with contexts we now use rewriting to propagate 
the changed information upward in the term tree. With respect to context extension, this 
approach results in a smaller number of rules of the transformed system. On the other hand, 
the property of the context extension of a one-to-one correspondence of the rewrite steps, 
is now weakened to a one-to-m correspondence where m < 1 -\- Sji{i — t- r). This means 
that an outermost rewrite step is modeled by one step in the transformed system plus a 
number of auxiliary steps necessary for updating the labels, and this number is bounded by 
the C-depth of the corresponding rule. In most practical cases this value is typically small 
(< 2). This is shown in Section [TOl where we evaluate the implemented transformations. 
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We illustrate the two transformations by means of our running example, the TRS Rq 
together with the algebra ^o which forms a C- model for Rq. The algorithm based on 



context extension transforms Rq into the following /zTRS ^Rq . which truthfully simulates 
outermost rewriting in i^o^ 

f(a^) ^ f^(f(a^)) top(f^(f(x))) ^ top(b) 

top(a*)^top(f(a*)) top(f*(f^(x)))^top(b) ^ °^ 

The rule f(a*) —?■ f*(f(a*)) is obtained from prepending the context f(n) to a ^ ^(s)- This 
enables correct updating of the labeling of the context during rewriting. Because we still 
have to allow rewrite steps a — >■ f(a) of the original TRS at the top of a term, we extend 
the signature with a unary function symbol top which represents the top of a term. Thus 
when prepending contexts we include top(n), giving rise to the rule top(a*) — )• top(f(a*)). 
The necessity of the symbol top becomes apparent especially when we consider the rule 
f(f (x)) — 7> b. Here prepending the context f (□) is not even an option since f(f (f(2;))) -^ f(b) 
is not an outermost rewrite step; this rule can only be applied at the top of a term. Hence 
we get the two rules displayed on the right, one for each possible interpretation of the 
variable x. 

The second algorithm we define, that of dynamic labeling, transforms Rq (using ^o) 
into the following /uTRS , which we denote by 't^ Rq '■ 

a* -^ relabel°'i(f(a*)) f(relabel°'i(x)) -^ f*{x) 

f^(f(x)) -^ relabel^'°(b) top(relabel°'Hx)) -^ top(x) (ri?o) 

f*(f*(x)) -^ relabel^'°(b) top(relabel^'°(x)) -^ top(x) 

where rewriting beneath the redex symbol f* and the symbols relabel ' and relabel ' is 
disallowed. Displayed on the left, we recognize the original rules. Since left and right- 
hand side of the original rule a — >■ f(a) have distinct interpretations (0 and 1), in "I^Rq the 
right-hand side is prefixed with the symbol relabel ' . By application of the relabeling rules 
(displayed on the right), this symbol moves upward to take care of the update of labels 
in the context of the original rule application. Likewise, the rule f(f(2;)) -^ b (of which 
there are two versions in t'^-^O) one for each value x can be assigned to) means a change of 
interpretation, and relabeling the context is necessary. In this example, each original step is 
accompanied by exactly one relabel step. The relabel symbols dissolve after one such step. 
The only (maximal) rewrite sequence from the term top(a*) in t'^-Ro is: 

top(a'^) ^^ top(relabe|0'i(f(a'^))) ^^ top(f(a*)) ^^ top(f(relabe|0'i(f(a*)))) 

^^ top(f*(f(a'^))) ^^ top(relabe|i'0(b)) ^^ top(b) 

Notice the correspondence with the outermost rewrite sequence (jl.ip . 

Clearly, semantic labeling increases the number of rules and the number of symbols 
of a TRS. This results in a larger search space for finding termination proofs, and hence 
may lead to exhaustion of time or memory resources. On the other hand, one can say that 
semantic labeling does not complicate termination proofs, in the sense that proofs for the 
unlabeled system carry over to the labeled one: whenever R' is a labeling of a TRS R and 
A = {A, [•]], )^,3) is a monotone S-algebra [EWZ08] which proves termination of R, then 
the extension of [•]] to the labeled signature S' by defining [f J = |f] for every f € S and 
label A, yields a monotone S'-algebra witnessing termination of R' . Apart from this, the 
labeled systems often allow for simpler proofs, because the enriched signature provides for 
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more freedom in the choice of interpretations, see |Zan95] . As the transformations presented 
here are based on a variant of semantic labehng, they inherit these properties from semantic 
labehng. 

The two transformations have been implemented by the first author in the termina- 
tion prover Jambox |End09] . Notwithstanding the increased number of rules by semantic 
labeling and context extension, Jambox performs efficiently on the set of examples from the 
Termination Problem Database (TPDB jTerOS) ). and was best in proving termination in 
the category of outermost rewriting of the termination competition of 2008 [TerOSj . see 
Section [lOl 

Related work. The first tool for proving outermost termination was Cariboo |FGK02t 
[GK09j . Cariboo is a stand-alone tool, and its method is based on induction. 

For the idea of a transformational approach to outermost termination in order to make 
use of the power of termination provers we were inspired by |RZ09] , which in turn is based on 
ideas in |GM04] . In |RZ09] the signature is enriched with unary symbols top, up, and down 
and the TRS is extended with 'anti-matching' rules such that down(t) is a redex if and only if 
t is not a redex with respect to the original TRS. The idea is that the symbol down is moved 
down in the term tree as long as no redex is encountered. Once a redex is encountered, 
a rewrite step is performed, and the symbol down is replaced by up, which then moves 
upwards again to the top of the term, marked by top. This transformation is implemented 
in the tool TrafO |RZ09] . participant in the termination competion of 2008 |Ter08) . 

Based on a similarly elegant idea, Thiemann |Thi09| defines a complete transformation 
from outermost to innermost rewriting, which is implemented in AProVE. For traversal to the 
redex positions, rules of the form down(isRedex(f(. . .))) — > f(. . . ,down(isRedex(. . .)), . . .) are 
used. In order to simulate outermost rewriting and to prevent from moving inside redexes, 
rules isRedex(£) — > up(r) are added for every rule £ ^ r oi the original TRS. Then, by the 
innermost rewriting strategy, the latter rules have priority over the traversal rules, whenever 
an original redex is encountered. 

The simplicity of both approaches is attractive, but the yo-yoing effect in the resulting 
TRSs makes that the original outermost rewrite steps are 'hidden' among a vast amount 
of auxiliary steps. This increases derivational complexity, and makes it hard for automated 
termination provers to find proofs for the transformed systems. 

The present paper is a modified and extended version of JEH09J . In particular, we 
introduce a novel approach for proving outermost termination: dynamic labeling (Section[6]). 
We stress that the number of extra relabeling steps introduced in the dynamic labeling of 
a system is typically small and bounded by the C-depth of the applied rewrite rule. 

2. Preliminaries 

For a general introduction to term rewriting and to context-sensitive rewriting, we refer 
to |Ter03j and |Luc98j . respectively. Here we repeat some of the main definitions, for the 
sake of completeness, and to fix notations. 

A signature S is a non-empty set of symbols each having a fixed arity, given by a 
mapping [j : S — )• N. We write jjf for the arity of f G S, and we define S„ = {f € S | (Jf = n}. 
Given S and a set X of variables, the set TiT,, X) of terms over S is the smallest set 
satisfying: X C r(S, X), and f(ti, . . . , t„) G T{T,, A') if f G S of arity n and ti G r(S, X) 
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for all 1 < i < n. We use x,y,z, . . . to range over variables, and write Var{t) for the set of 
variables occurring in a term t. 

The set of positions Pos{t) C N* of a term t € T(S, X) is defined as follows: Pos{x) = 
{e} for variables x £ X and Pos(f(ti, . . . ,tn)) = {e} U {ip \l<i<n, p£ Pos{ti)} for 
symbols f € Sn- We write root{t) to denote the root symbol (or variable) of t, and t\p for 
the subterm of t rooted at position p. Then root{t\p) is the symbol at position p in t. 

A substitution a is a map o" : Af — )■ T{Ti^X) from variables to terms. For terms 
t G T(S, X) and substitutions a, ta is inductively defined by xa = (j{x) for x € Af, and 
f(ti, . . . , t„)cr = f (tier, . . . , t„cr) for f G i;„, and ti, . . . , t„ G T(i;, A'). 

Let □ be a fresh symbol, i.e., □ E U -Y. A context C is a term from T(S, ,Y U {D}) 
containing precisely one occurrence of □. By C[s\ we denote the term Ca where cr(n) = s 
and (t{x) = x for all x £ X . We use C(S, A') to denote the set of contexts over S and X. 
We write Var{C) with (7 G C(S,^) to denote the set of variables of C excluding □. The 
depth of a context C is defined as the length \p\ of the position p at which □ resides, that 
is, the position p such that root{C\p) = □. 

A terrn rewriting system (TRS) over S is a finite set of pairs {i, r) G T(S, X) x T(S, X)^ 
called rewrite rules and written as £ — ?• r, for which the left-hand side i is not a variable 
(£ (Y) and all variables in the right-hand side r occur in t. Var{r) C Var{i). For a TRS R 
we define — t-r, the rewrite relation induced by R as follows: For terms s,t G T(S,Af) we 
write s -^r t, or just s ^^ t if i? is clear from the context, if there exists a rule i ^^ r (^ R, a 
substitution a and a context C G C(S, A") such that s = C[£a] and t = C[ra]; we sometimes 
write s -^R,p r to explicitly indicate the rewrite position p, i.e., when root{C\p) = □. A 
term of the form ia, for some rule i —^ r G R, and a substitution a, is called a redex. 

For terms s and t, we say that s outermost rewrites to t at a position p G Pos{s), 
denoted by s ^R^p t, if s -^R,p t and for all positions p' strictly above p (i.e., p' a proper 
prefix of p) we have that s\pi is not a redex with respect to R. 

A binary relation >- '^ Ax A over a set A is called well-founded if no infinite decreasing 
sequence oi ^ a2 ^ 03 >- . . . exists. A TRS R is called terminating or strongly normalizing, 
denoted by SN(i?), if -^r is well-founded. 

A mapping ^ : S — ?• 2 is called a replacement map (for T,) if for all symbols f G S 
we have /u(f) C {1, . . . , jjf}. When we define a replacement map ^, the case for constants 
a G S is left implicit, as we always have /u(a) = 0. A context-sensitive term rewriting 
system (fiTRS) is a pair {R, fJ-) consisting of a TRS R and a replacement map fi. The 
set of ^-replacing positions Pos^{t) of a term t G T(S,^) is defined by Pos^{x) = {e} 
for X G <Y and Pos^(f(ti, . . . ,t„)) = {e} \J {ip \ i ^ ^(f), p G Pos''(tj)} for f G S^ and 
ti, . . . ,t„ G T(S, A"). 

In context-sensitive term rewriting only redexes at /x-replacing positions are contracted: 
we say s ^-rewrites to t, and denote it by s — t-r,^ t whenever s -^R,p t with p G Pos'^{s). 
For instance, consider the system R consisting of the single rule: 

a -^ cons(b, a) 

and let /U be given by /u(cons) = {1}. Then, obviously R is non-terminating. On the other 
hand, the context-sensitive TRS {R, /u) is terminating, because the replacement map of the 
symbol cons allows rewriting only in its first argument. 

We conclude this section by defining some non-standard notions. 

Definition 2.1. A thin context is a context that has at every depth at most one symbol 
from S U {□}; all other symbols are variables. 
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Example 2.2. f(f(x,g(n)),y) is a thin context, whereas f(g(n), h(a;)) is not, since g and h 
are at the same depth. 

Definition 2.3 ( |SM08] ). A flat context is a context C € C(S, X) of the form: 

C = \[Xi, . . . , Xj — i, □, Xj^i, . . . , Xn) 

where f G S„ with n > and xi,X2, • • • € <Y are pairwise distinct variables. For a term 
t G T(i;, X) we say that C is fresh for t if Var{C) D Var{t) = 0. We denote the set of flat 
contexts fresh for t by Cj (S, X). 

Definition 2.4 ( |RZ09) ). A TRS i? is called quasi-left-linear if every non-linear left-hand 
side of a rule in R is an instance of a linear left-hand side from R. 

Example 2.5. The following TRS is quasi-left-linear (and outermost terminating): 

g(f(2;), x) -^ g(g(2;, x),x) g{x, y) ^ y 

3. Generalizing Models to C-models 

In outermost rewriting the only redexes which are allowed to be rewritten are those 
which are not nested within any other redex occurrence. We model this strategy by context- 
sensitive rewriting with the use of semantic labeling: we mark the symbols which are the 
root of a redex in order to disallow rewriting within that redex. We first recall the definition 
of semantic labeling and of models from [Zan95], and then generalize these to fit our purpose. 

Definition 3.1. A Ti-algehra A = {A, [•]) consists of a non-empty set A, called the domain 
of A, and for each n-ary symbol f E S a function |f]] : A" -^ A, called the interpretation 
of f . Given an assignment a : X ^ A of the variables to A, the interpretation of a term 
t e T{T,,X) with respect to a is denoted by lt,al and inductively defined by: 

lx,al = a{x) [f(ti, . . . ,t„),al = [fl([ti,al, . . . , [t„,a]) 

where x G X, f £ S„, and tj G T(S, A") for 1 < i < n. For substitutions a : X ^ T(S, X), 
we write |<t, aj for the function Ax.[[(T(x),a]. For ground terms t G T(S,0) and ground 
substitutions a : X ^ T(S, 0) we write ft} and |(t] for short. We usually write A for both 
the algebra and its domain, and we use [•] to denote the interpretation function of A. 

Lemma 3.2. Let A be a T,-algebra, a : X ^ A an assignment, and a : X ^ T(S, X) a 
substitution. Then, for all terms t G 7~(S, A:*).' Jto", a] = Jt, [o", aj] . □ 

For completeness of the transformation of context extension (Theorem I5.13P , it is im- 
portant that there exist no 'junk' elements in the E-algebra, that is, elements a for which 
there are no ground terms t such that |i]] = a. See Example 15.141 For that reason we 
restrict S-algebras to 'core' S-algebras whose domain equals the set of all interpretations 
of ground terms over S. 

Definition 3.3. The core of a T,-algebra A is the S-algebra Ac = {Ac, [[-Jc) where Ac is the 
least set such that [[f|(ai, . . . , a„) G Ac whenever f G Sn and ai, . . . , a„ G Ac, and where 
I'Jc is the restriction of fj to Ac. We say that A is core whenever A = Ac. 

By construction of the core of a S-algebra we then obtain: 

Lemma 3.4. For every element a a Ac of the core of a T,-algebra A there exists a ground 
term t G T{T,, 0) with {t} = a. D 



8 J. ENDRULLIS AND D. HENDRIKS 

Definition 3.5 ( [Zan95j ). A semantic labeling {A, it) consists of a S-algebra A and a family 
vr = {vTflfgs of labeling functions vTf : A^ — ?> Af where Af is a finite and non-empty set of 
labels for each symbol f G S. For a term t G T(S, X) and an assignment a : X ^ A, we 
define lab{t,a), the labeling of t with respect to a, inductively as follows: 

lab{x,a) = X 

lab{f{ti, . . . ,tn),a) = f {lab{ti, a), . . . , lab{tn,a)) 

where x e X, f e S„, ii, . . . , t„ G T(S, Af) and A = 7rf([ti, aj, . . . , {tn, a}). 

Let Rhe a TRS over S. The semantic labeling of R is the TRS lab{R) over the labeled 
signature labijl) = {f | f G S, A G Af}, defined by: 

/a6(i?) = {lab{l, a) -^ lab{r, a) \ £ -^ r e R, a : Var{e) -^ A} 

For a substitution a : X ^ T{T,,X), and an assignment a : X —?■ A, we write lab{a,a) 
for the function Ax./a6(cr(j;),a). For ground terms t G T(S, 0) and ground substitutions 
a : X ^ T(S, 0) we write lab{t) and lab{a) for short. 

Term labeling satisfies the following useful property: 

Lemma 3.6 ( |Zan95j ). Let A he a Ti-algehra, let a : X ^ A be an assignment, and let 
a : Af — )• T(S, X) be a substitution. Then, for all terms t G T(S, X) it holds that: 

lab{ta, a) = lab{t, {a, a}) lab{a, a) 

Proof. Direct by induction on the term structure, and Lemma 13.21 □ 

The S-algebra of a semantic labeling has to satisfy certain constraints in order to obtain 
that a TRS is terminating if and only if its labeled version is. In [Zan95] the algebra has to 
be a 'model': 

Definition 3.7. A E-algebra A is called a model for a TRS R if for all rules £ ^ r £ R and 
assignments of variables in the left-hand side a : Var{i) — >■ ^ we have that [£, aj = [r, a}. 

In the introduction we argued why this notion of model is too restrictive for the purpose 
at hand. In order to be able to distinguish between redex and non-redex positions we 
introduce C-models, a generalization of models. 

Definition 3.8. A C-model for a TRS R over S is a S-algebra A where for each rule 
i —^ r £ R there exists an n G N such that for each context C of depth n and assignment 
a : A' — >■ ^ we have [[C[£],a]] = [C[r],a]]. When n G N is minimal for a rule i ^ r with 
respect to this property, we call n the C-depth for i ^ r, and denote it by 5jx{£ — )■ r). The 
C-depth for R with respect to A, denoted by Sj{{R), is defined as the maximal C-depth of 
the rules of R: d^iR) = max {6^{i -^ r) \ i ^ r e R}. 

Example 3.9. Let Ri he the following TRS over S = {c, f, g} (where c is a constant): 

f(g(x))^f(f(g(x))) f(f(f(^)))^^ (i?i) 

The algebra Ai = {.L, f,jj,g} with the interpretation function defined, for all x & Ai, by: 

14 = ± iqi±) = iq(g) = f iqif) = ifi(ff)=ff M{x) = g {Ai) 

forms a C-model for Ri. The C-depth of the rule f(g(a;)) — > f(f(g(x))) is 1; both contexts 
f(n) and g(n) make that left and right-hand side of the rule have equal interpretations, 
respectively ff and g, regardless of the value we assign to the variable x. For the other rule 
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the C-depth is 2: If the interpretation of a; is _L or (7 then f (□) is not yet enough to interpret 
both sides by the same element of the algebra; an additional context f(n) or g(n) has to 
be wrapped around. Thus the C-depth of the TRS is 5^j(i2i) = 2. 

We now define semantic labelings based on C-models, to which we refer as 'C-labelings'. 
We define a C-labeling over an extended signature Stop = S U {top}. The symbol top 
represents the top of a term, and we assume top to be fresh for S, i.e., top S. Moreover, a 
C-labeling includes a set S C lab{Tj) of 'redex symbols', the set of symbols below which 
rewriting should be forbidden. For example, for a sound transformation from outermost to 
context-sensitive rewriting it has to be guaranteed that in a well-labeled term the symbols 
from S'"^'^ occur at redex positions only. 

Definition 3.10. Let i? be a TRS over S. A C-labeling for R is a tuple {A, vr, S ) where 
^ is a C-model for R, {A,tt) is a semantic labeling over the signature Stop = S U {top}, 
and S C lahiTi) is a subset of the labeled signature. We fix the interpretation [topj of 
top to be an arbitrary constant function Ax. a for some a ^ A. 
A C-labeling (>l,7r, S^""') for R is called: 

(i) sound if root{lab{t)) € S implies that t is a redex with respect to R, for all ground 

terms t G T(S,0); 
(ii) complete if root{lab{t)) G S whenever t is a redex with respect to R, for all ground 

terms t G 7'(S,0); 
(iii) maximal if 7rf(ai, . . . , a„) = (ai, ...,an), for all symbols f G S„ and all values 

CL\ , . . . , Clji G ^ , 

(iv) core if the S-algebra A is core. 

Remark 3.11. From Definition 13.101 it follows that a C-labeling (^, vr, S'"'^'^) for R is: 

— sound if and only if root {lab {t)\p) G S implies that t\p is a redex with respect to R, 
for all ground terms t G T(S, 0) and all positions p G Pos{t), and 

— complete if and only if root [lab {t)\p) G S whenever t\p is a redex with respect to R, 
for all ground terms t G T(S, 0) and all positions p G Pos{t). 

Example 3.12. We continue with Example 13.91 where we defined a C-model ^1 for the 
TRS Ri. We let {AijTt) be the semantic labeling where vr labels each symbol with the 
interpretation of its arguments. The set of redex symbols is defined by S = {f^,f^}. 
These symbols correspond to redex positions with respect to the first and the second rule 
of i?i. Then (^1, vr, S ) forms a sound, complete, maximal, core C-labeling for Ri. 

We explain why we fix the interpretation [topj to be a constant function. First note 
that if [topj is a constant function, then the extension of the signature with top does not 
interfere with the property of A being a C-model for R. Second, the transformation given in 
Section [5] extends the rules with contexts until the interpretations of the left and right-hand 
side are equal. If [top| is constant, then the extension halts at the symbol top, that is, no 
further contexts are prefixed to top. This corresponds to the intuition of top representing 
the top of the term. Moreover, it is not important which constant function Xx.a we choose 
for [topj: as no symbols will be prefixed to top, no symbol will be labeled with its value. 

Remark 3.13. The transformations defined in Sections [5] and [6] are sound whenever we use 
a sound C-labeling. This means that termination of the target system implies outermost 
termination of the original system. On the other hand, using a complete C-labeling does 
not guarantee completeness of either transformations. More precisely, using a complete 
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C-labeling does not imply that the transformed system is terminating whenever the origi- 
nal system is outermost terminating. A complete C-labeling guarantees that in a correctly 
labeled term all redex positions are marked, so that only outermost steps are possible. But, 
for the transformation to be complete we need two more properties. First, rewriting needs 
to preserve correct labeling of terms. Secondly, for the labeled system, global termination 
of all terms (including the not correctly labeled ones) should be equivalent with local termi- 
nation [EdVWOO] of the well-labeled terms. This point is of practical importance because 
the state of the art of automated analysis for global termination is far more advanced than 
for local termination. Both properties do in general not hold for complete C-labelings. 

Remark 3.14. Maximal C-labelings can be defined in a more general fashion by requiring 
TTf (a ) 7^ VTf (6 ) for all f € $]„ and all a,b € A^ with a ^ b . The important point is that the 
value of all arguments can be inferred from the label. For the sake of a simple presentation 
we stick to the definition where labels are tuples of argument values. 

4. Static Context Extension 

In this section we describe a naive approach for semantic labeling with C-models. This 
serves both as an introduction and as a motivation for the transformations that we present 
in Sections [5] and [6l Input for these transformations is a TRS together with a C-model for 
this TRS. In Sections [THS] we explain how C-models are constructed. 

As can be inferred from Definition [3]8l it is possible to transform a TRS R by prepending 
contexts to its rules in such a way that its C-model A becomes a inodel for the transformed 
system R, and then apply the usual semantic labeling to R. We call this transformation 
'static context extension', as opposed to the transformation of 'dynamic context extension' 
presented in the next section. In the dynamic version, contexts are prepended only when 
needed and dependent on the values assigned to the variables in the rules. 

Since every context is an instance of a thin context (Definition [24]) with the same depth, 
rules are prefixed by thin contexts, in both versions of context extension. 

Definition 4.1. Let i? be a TRS and (^, 7r,S ) a C-labeling for R. The static context 
extension of R with respect to (^,7r, S'''^'^), is the /iTRS {^R,fJ.) where /^R is the TRS 
resulting from the steps listed below and where the replacement map fi is defined by /u(f) = 
if f G S^*^*^, and ^(f) = {!,..., ftf} otherwise, for all f G k6(Stop). 
(i) Replace each rule i ^ r with C-depth n by the set of rules: 

(a) C[£] -^ C[r] for each thin context C of depth n, and 

(b) top(C[^]) -^ top(C[r]) for each thin context C of depth < n. 
We let R denote the union of these sets. 

(ii) Apply semantic labeling to R using the C-labeling (.4, tt). We obtain lab{R). 
(iii) Remove from lab{R) all labeled rules that contain a redex symbol from S in the 
prepended context. The TRS thus obtained is denoted by M'R. 



Note that the rules from item i(b) of Definition 14. II model the application of an original 
rule at a depth smaller than its C-depth, that is, 'near' the top of the term. The created 
rules are hence wrapped into top(n). 

Example 4.2. We illustrate the transformation by static context extension on the TRS Ri: 

f(g(x))^f(f(g(x))) f(f(f(x)))^x (i^i) 
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together with the C-model Ai = {-L, f,ff,g} introduced in Example 13.91 and the C-labehng 
{Ai,7r,T.'"'"^) defined in Example [3l2l 

The first step of the transformation yields Ri which consists of the following 10 rules: 

C[f(g(x))] ^ C[f(f(g(x)))] D[fim^)))] ^ D[x] (i?i) 

where C G {top(n),f(n),g(n)}, and D G {top(n), top(f(n)),top(g(n)),f(f(n)),f(g(n)), 
g(f(n)),g(g(n))}. Note that the algebra Ai extended with the symbol top and interpre- 
tation [[top](x) = ± for all X G >li, is a model for the TRS Ri (in fact, any value for the 
interpretation of top will do). 

The second step is to label the TRS Ri. This yields lab{Ri) consisting of 4- (3 + 7) =40 
rules, four instances for each of the ten rules, one for each value that can be assigned to x. 

The final step is to remove from lab{Ri) each rule which contains a redex symbol 
within the context that was prepended in the construction of Ri . Such a rule would enable 
a rewrite step which is not outermost. Of the 40 rules of above, 12 have to be thrown out. 
This concerns the labelings of the second rule of Ri where f(f (D)), top(f (□)) or g(f(n)) has 
been prepended. They contain the redex symbol f-" in the prepended context. For instance, 
if we prepend the thin context f(f(n)) to the rewrite rule f(g(x)) — > f(f(g(x))) and then 
perform semantic labeling, we obtain the following rule in lah[Ri): 

fi5'(f^(f3(g^(x)))) ^ f^(f^(f^(f^(g^(x))))) (4.1) 

This rule (with the redex symbol f-^ in the prepended context) has to be discarded as it 
would admit the following infinite rewrite sequence, altough Ri is outermost terminating: 

top/(f^(g^(c))) ^, topB{^f{^^{g^{c)))) ^, top^(f^(f^(f^(g^(c))))) 

^ top^(fif (fif (f/(fP(g^(c)))))) ^^ top/(f^(g^(c))) ^,... 

We note in advance that the dynamic context extension A^Ri of Ri, worked out in 
Example 15. 11 consists of 19 rewrite rules, whereas the static version ^Ri above consists of 
28 rules. 

In general, the presence of a redex symbol may depend on the interpretation of the 
variables. This is better demonstrated by the following example. 

Example 4.3. Consider the TRS over the signature {c, f, g} (with c a constant): 

g(f(g(x)))^f(g(g(f(x)))) f(x)^X (i?2) 

We use the C-model A2 = {-\-,g,fg} with the interpretation of the symbols defined by: 

[cl = ± M{x)=g m{g)=fg m{±) = mifg) = ± {A2) 

for all X G A2- Again we use maximal labeling so that the symbols g-*^, f , f^ and f^^ 
correspond to redex positions. The C-depth of the rule f (x) — >■ x is 2 and its static context 
extension contains the rule g(g(f(x))) — )■ g(g(x)). From this we obtain three labeled rules: 

g5(g^(f^(x))) ^ g^(g^(x)) for a(x) = ± 

gff(gA(fS(2,))) ^ g3(g3(x)) for a(x) = g 

gS(g^(f^^(x))) ^ g^(g^^(x)) for a(x) = fg 

The second rule should not be allowed, as it would enable a rewrite step that is not outer- 
most. This is witnessed by the symbol g-^^ in the prepended context. 
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5. Dynamic Context Extension 



We present an approach for semantic labeling with C-models, called 'dynamic context 
extension' 1^ where we stepwise extend rules by contexts, only when needed and dependent on 
the variable interpretation used for the semantic labeling. For different interpretations of the 
variables usually different context depths are necessary for achieving equal interpretations 
of left and right-hand side. In each extension step we check whether a candidate symbol 
is a redex symbol, and, if it is, this symbol is excluded from prepending. Here, by a redex 
symbol we mean a labeled symbol which indicates the presence of a redex in the original 
system. Dynamic context extension is more efficient in the sense that both the number and 
the size of the rules of the resulting /iTRS are smaller than in the static version defined in 
the previous version. 

The transformation starts with constructing pairs {I -^ r, a) of rules and variable 
assignments. Then these rules are extended with flat contexts until the interpretations of left 
and right-hand side are equal. Finally, each obtained rule is labeled using the corresponding 
interpretation. More precisely, we implement this process as follows. 

We iteratively construct sets Po,Pi, . . ., until Pj+i = Pi for some i. The initial set Pq 
consists of pairs {i -^ r, a) for each rule i —^ r, and each interpretation a : Var{i) -^ A 
of the variables. Then, in each step, Pj+i is obtained from Pi by replacing every pair 
(£ — )• r, a) of Pi for which the interpretation of the left-hand side differs from the right-hand 
side (l£, a} ^ [r, aj), by the pairs {C[£] -^ C[r],a') for every flat context C (Definition 12. 3p 
and every extension a' : Var{C[£]) — > ^ of a, such that the root of the labeled, extended 
left-hand side lab(C[£],a') is not a redex symbol. Among the flat contexts to be prepended 
we include top(n) to cater for the case that the rule is applied at the top of the term. 

Example 5.1. We reconsider from Examples 13.91 and 13.121 the term rewriting system Ri: 

f(g(x)) ^ f(f(g(x))) Hm^))) ^ ^ (Ri) 

together with the C-labeling (^i,7r, S'^'^'^), where Ai = {.L, f,jj,g}, vr labels symbols with 
their arguments and S'"'^'^ = {f^,f-ff}. The initial set Pq of pairs (rule, assignment) is: 

Po = {{Hg{x)) ^ f(f(g(x))), Xx.a) , (f(f(f(x))) ^ X, Xx.a) \ a e Ai} 

The only element {i -^ r,a) of Pq such that [[^, a] = [r, aj is (f(f(f(x))) — > x,Xx.ff). For 
this pair no context needs to be prepended. The other pairs have to be replaced by their 
context extensions and thus Pi consists of the following (4 •3 + 1 + 3-2 = 19) pairs: 

(C[f(g(x))] ^ C[f(f(g(x)))], Xx.a) for aU a e Ai, C e {top(n),f(n),g(n)} 

(f(f(f(x)))^x,Ax.#) 

(C[f(f(f(x)))] ^ C[x],Xx.a) for aU a /#, C G {top(n),g(n)} 

In the last line the context !(□) is excluded, because the labeled left-hand side of the rule 
would contain the redex symbol f-^ within the prepended context, and thus the step would 
not be outermost. Because of the outermost strategy, the original rule is only applicable in 
a context C[g(n)] (where C does not contain any redexes) or at the top of a term. Now 
for all rules in Pi the left and right-hand side have equal interpretations, and hence the 
iterative construction is flnished. 



In [EH09] we used the term 'dynamic labeling' for what we here call 'dynamic context extension'. The 
term 'dynamic labeling' is now reserved for the transformation that we define in Section [6] 
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Secondly, the obtained set Pi is labeled using the family vr of labeling functions. The 
desired context-sensitive TRS A^Ri then consists of the rules lab{£,a) — > lab{r,a) for 
every (£ — )• r, a) € Pi, with the replacement map /x defined by/x(h) = 0ifhG {f^, f-^}, and 
/i(h) = {1, . . . , j|h} otherwise, for all h G lab{T,). Thus the dynamic context extension of Ri 
consists of 19 rules. Recall from Example 14.21 that the static context extension of Ri had 
28 rules. 

We now formalize this transformation. 

Definition 5.2. Let R he a TRS over S, and (^,7r, S*^*^^) a C-labeling for R. We define 
P'"{R) as the least fixed point of the following construction of sets Pq,Pi,. . . , that is, 
P^{R) = Pi as soon as Pj+i = Pi for some i. The initial set Pq is defined by: 

Po = { {£ ^ r,a) \ i ^ r e R, a : Var{i) -^ A } 

and for i = 0, 1, . . . the set Pj+i is obtained from Pi by replacing every pair {i ^- r,a) such 
that p, aj 7^ |r, a], or r G Af, by all pairs in A(£ — >■ r, a) where we define: 

A{i^r,a) = {{C[i]^C[r],a + (3) \ C G C^(Stop, <1^), /3 : Var(C7) ^ ^ 

rooi(/a5(C[£],Q + /3)) S'''^'^ } 

Here, for partial functions / and g with disjoint domains, we write f + g for the function 
defined by (/ + g){x) = f{x) if x E dom{f), and (/ + g){x) = g{x) if x G dom{g). 

The construction of P'^{R) is guaranteed to terminate because of the assumption that 
^ is a C-model for R. 

Definition 5.3 (Dynamic context extension). Let i? be a TRS over S, and (^, vr, E'"'^'^) a 
C-labeling for R. The dynamic context extension of R with respect to {A, vr, S'"'^'^) is the 
/iTRS {A^R, fi) consisting of: 

A^P = { lab{i, a) -^ lab{r, a) \ {i ^ r, a) G P^{R) } 

and the replacement map fi, defined by ^(f) = if f G S*"^*^, and /x(f) = {!,..., (jf} 
otherwise, for all f G /a6(Stop). Whenever the set S'"'^'^, which determines the replacement 
map, is clear from the context, we write A^R as a shorthand for {/^R, /i). 

Remark 5.4. In the transformation given in Definition 15.31 collapsing rules are always 
prepended by at least one fiat context. Consequently, {A^R, fj.) contains no collapsing 
rules. This is used in the proof of Theorem 15.131 in order to apply Theorem 15.121 Without 
this elimination of collapsing rules, the transformation is still sound (Theorem 15. Sh . Note 
that in the TRS Ri worked out before, we did not eliminate the collapsing rule. 

Let us work out another example. 

Example 5.5. We consider problem zantema08/dupl_rhs.trs from the TPDB [TerOSj : 

f(h(x),c) ^f(i(x),s(x)) i(x)^h(x) 

f(i(x),2/)^x h(x)^f(h(x),c) ^ '^ 

We denote this TRS by Rs, and take the algebra As = ({-L, c, h, i}, |-]]) with |-] defined by: 

M = c M (x) = h H (x) = i m (x, y) = [si (x) = ± 

for all x,y €z A3. Furthermore, we employ minimal labeling; only the function symbols that 
are at the root of a redex occurrence are marked. Thus the symbols h, i are always marked: 
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7rh(2;) = 7ri(3;) = *. We let TTf{i,x) = 7rf(/i, c) = *, and leave f unmarked otherwise. Also, 
the symbols s and c are never marked. 

The dynamic context extension A^R^ is then formed by the rules: 



f*(h*(x),c)^P(i*(x), 


s{x)) 


s{ri\*ix),y))^six) 




fiz,r{\''{x),y))^f{z,x) 




f(z,P(i'^(x),y))^P(z,x) 




fr(\*{x),y),z)^f{x,z) 




f(P(i*(x),y),z)^P(x,z) 




top(f*(i'^(x),y))^top(x) 





s(\^x)) 


^ s(h*(x)) 


f(y,i^(a;)) 


^f(y,h*(x)) 


top(i*(x)) 


^top(h'^(x)) 


s(h*(x)) 


^s(f^(h*(x),c)) 


HyXi^)) 


^f(y,P(h'^(x),c)) 


f(h^(x),y) 


^f(P(h'^(x),c),y) 


top(h*(x)) 


^top(f*(h*(x),c)) 



(A-i?3) 



We now work towards the first main theorem, stating that outermost ground termina- 
tion of R is implied by termination of the transformed system A^R. 

Lemma 5.6. Let R be a TRS over S, and let (^, vr, S'"^'^) be a sound C -labeling for R. 
Moreover, let s,t a T(S, 0) be ground terms and p E Pos{s) such that s ^R,p t. Then for 
all proper prefixes q of Ip we have root {lab {X.o\i{s))\q) Ti . 

Proof. For q = e this follows from top'^ ^ S^'^'^ for any label A. Otherwise we have that 
root{lab{top{s))\q) = root [lab {s)\qi) with q' a proper prefix of p, and if root{lab{s)\q') € E'"'^'', 
then, by definition of sound C-labeling, s contains a redex at position q' , quod non. □ 

The following lemma states that any outermost ground rewrite step in R can be trans- 
formed into a rewrite step in A^R. 

Lemma 5.7. Let R be a TRS over S, and let {A, vr, T,^'^'^) be a sound C-labeling for R. Let 
s,i G T(S,0) be ground terms such that s ^/j t. Then: 

lab{top{s)) ^A^R,t^ lab{top{t)) 

Proof. Assume s ^R,p t for some position p € Pos{s). Then there exists a rule i ^ r G R, 
a context C with root{C\p) = □ and a ground substitution a such that s = C[ia] and t = 
C[ra]. We consider the construction of the dynamic context extension from Definition 15.31 
and prove by induction that for all i = 0, 1, . . . there exists a context d which is a prefix 
of top(C), a ground substitution at, and terms ii, ri such that top(s) = Ci[^jcrj], top(t) = 
Ci[riai] and (£i — )• r^, \ai\) G Pi. For the base case we have (£o — ^ ^Oi [coD G -Po with Iq = i, 
ro = r, (Jo = a, and Cq = top(C). For the induction step we assume the existence of Cj, cjj, 
and {ii -^ ri, [o-jj) € Pi with the above properties. If [£«, [[o"j|]] = [rj, [ctj]]]] and ri ^ X then 
by definition (^j -^ ri, {ai}) G -Pi+i, and so we are done. For the remaining cases {ii, [aj]] ^ 
In, laijj and r^ G A", we first show that d + □. If lU, lai\\ / [r^, lai\\ and Ci = □, 
then iiOi = top(s) and rjcjj = top(t), and hence root{ii) = root{ri) = top, contradicting 
I^j! [fill / l^i) [fill (recall that the interpretation of top is constant). Furthermore, we 
have ri G X only if i = 0, and then Ci = top(C) ^ □. Thus we have Cj = D[D'a'] for some 
context D, flat context D' G C\_ and substitution a' . We choose Cj+i = D, £j+i = D'[i.i], 
Tj+i = -D'[rj], and (Tj+i = (Tj -|-cr'. It remains to be shown that (^j+i -^ ?'i+i, [[o"j+i|) G i^i+i. 
For this it suffices to prove that root {lab {£i+i, [o'j+iD) ^ S'''^'^. We have Cj+i[^j+iO"j+i] = 
top(s). Let g be the position such that root(Cj+i|g) = □. Then, by Lemma 13.61 we obtain 
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rooi(/a5(£i+i, Juj+i]])) = root (lab {ii^iai^i)) = root(top{lab{s))\q). Note that g is a proper 
prefix of Ip. By Lemma 15.61 we fiave root {lab {top{s))\q) S . 

Let i be such that Pj+i = Pj. By the result above we have {£i — ^ r^, |o"i]]) € P 
with [[£j(tJ = [rjO-j]], and /a6(£j, [aj) — >■ k6(ri, |(tJ) G A'^i? by definition. Let r and 
V be defined by r(n) = £j(Tj, t;(n) = tjo-j, and r(x) = f (x) = x for x € <Y. Then 
we have that lab{Ci,lTJ) = lab{Ci, Ivj) since |t] = {vj. Let S = k6(Ci,[T]). We get 
/a&(top(s)) = lab{Ci[iiai]) = labidr) = Elab{T) = E[lab{(.iai)] = E[lab{li,lai\)lab{ai)] 
and /a6(top(t)) = . . . = E[lab{ri, lail)lab{ai)], by Lemma [321 By Lemma \5M all symbols 
above position Ip in the term lab{top{s)) are not in S'"'^'^ and hence we have a /Lt-rewrite 
step: /a6(top(s)) -^A^R,fi lab{top{t)). D 

Theorem 5.8. Xei R be a TRS over S, and (^, vr, E'"'^'^) a sound C -labeling for R. Then 
R is outermost ground terminating if A^R is terminating. 

Proof. Assume that R admits an infinite outermost rewrite sequence: 

, O'id , O'id , out 

tl ->R 1,2 -^R ts -^R . . . 

Then from Lemma 15.71 it follows that A^R admits an infinite rewrite sequence: 

/a6(top(ti)) -^A^R,^i lab{tQp{t2)) ^a-r,/. lab{top{ts)) ^A-ij.^ • • • □ 

The following three examples illustrate why our method is sound, but not complete 
when applied to non-left-linear (and non-quasi-left-linear) TRSs. The first example can be 
handled by our approach employing the C-labeling constructed in Section |9l The second 
example fails using the C-labeling from Section [9l but can successfully be treated using 
a manually constructed C-labeling. For the third example, we show that there exists no 
C-labeling that can be employed for proving outermost ground termination; this example 
is out of reach for the approach proposed in this paper. 

Example 5.9. We consider the non-left-linear TRS R^ with three rules: 

g(x, x) -^ f (f(x, x), x) f (x, x) -^ g(x, x) f(x, y) ^y (P4) 

over the signature S = {f , g, a} where a is a constant (necessary for the existence of ground 
terms). We choose the algebra A4, = {±} with [aj = _L, [fK-L, -L) = -L, and [gK-L, -L) = -L. 
We label the symbols with the interpretations of their arguments, and define S*"^*^ = {f-*-'-*-}. 

Note that E'"'^'^ does not contain g"*"'"*". The reason is that using a finite algebra we can 
(in general) not recognize redex positions with respect to non-left-linear rules. By excluding 
g ' from S we allow rewriting below g even when g is the root of a redex. This is sound 
for proving outermost termination as it does not restrict the possible rewrite steps, but 
allows only additional steps. The symbol f-*-'-*- is part of S*"^*^; due to the rule f(x,y) — ?■ y 
each occurrence of f is a redex position. 

The dynamic labeling I^R^ is then formed by: 

g^,^(^,^)^f^,^(f^^(^,^),^) 

f^'^(x, x) -^ g^'^(x, x) {A'^Ri) 

f-^'-^(x,2/) -^ y 

where /i(f"'"'"'") = and ii{g-^'-^) = {1,2}. This system is terminating which can be seen as 
follows. After an application of the first rule: 

C[g^'^(t,t)] ^A^R,,^ C[f^'^(f^'^(t,t),t)] 
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the replacement map /i prevents us from reducing the inner f ' . Moreover, the second 
rule cannot be applied to the outer f ' since the left and the right subterm are not equal. 
Thus the only rule applicable to the displayed subterm is i-^'^{x,y) — > y which reduces the 
size of the term, and we can conclude termination by induction. 

Hence we conclude outermost ground termination of i?4 by Theorem 15. 8i Actually the 
same C-labeling allows also to infer outermost termination, see Lemma [5.15l (we simply add 
a fresh constant and a unary symbol s with interpretations JO] = _L and [[s](-L) = -L). 

Example 5.10. We consider the non-left-linear TRS R^ over the signature S5 = {g, a, b}: 

a^g(a,a) g(x,x) ^ b {R5) 

This TRS is outermost terminating. However, there exists no C-labeling that recognizes 
redex positions with respect to the non-left-linear rule g(x, x) — >■ b. A finite algebra cannot 
be used to check whether two arbitrary subterms ti and t2 of g(ii,t2) are equal. Thus 
it appears that, in order to have a sound transformation, we cannot include any symbol 
g in the set S of redex symbols. But then rewriting below g is allowed, and the rule 
a — )■ g(a, a) would lead to non-termination of the dynamic labeling /^R^. 

Nonetheless, in this particular example, the problem can be solved. If some element e 
of the algebra is the interpretation of precisely one ground term t, then, of course, [tij = 
P2I = e implies that ti = t2- Let us take the algebra A5 = {-L,a} with [aj = a, [bj = _L, 
and [gK^^)!/) = -L for all x,y G A5. We use maximal labeling and define S'"'^'^ = {g°'"}. 
That is, we mark redex positions g{t,t) only for the special case t = a. This C-labeling is 
sound since only redex positions are marked, but it is not complete; not all redex positions 
are marked. Nevertheless, this labeling can be used to prove outermost ground termination 
of i?5. The dynamic labeling /^R^ of R^ consists of: 

g"'^(a, x) ^ g^'^(g"'"(a, a), x) g"'«(x, x) ^ b 

g^''^(x, a) ^ g^'^(x, g«'"(a, a)) g^'^(x, x) ^ b (A^R,) 

top^(a)^top^(g«''^(a,a)) 

The employed C-labeling is not complete, and so the ;uTRS A'^i?5 admits rewrite sequences 
(starting from correctly labeled terms) that do not correspond to outermost rewriting, e.g: 

top^(g^'^(g'^'''(a, a), g»'"(a, a))) ^a^^^,/. top^(g^'^(b, g"'"(a, a))) 

^A-i?5,Mtop^(g^'^(b,b)) 

Despite of this, the ^TRS can be shown to be terminating, and since the C-labeling was 
sound, we conclude outermost ground termination of R^ by Theorem 15. 8[ 

Example 5.11. In Examples 15.91 and 15. 101 we have seen how our method can be applied to 
prove outermost termination of non-quasi-left-linear TRSs. We now consider an example 
which shows that not every non-left-linear TRS can be handled by our method: 

f(x)^g(f(x),f(x)) g(x,x)^b (Re) 

This TRS is outermost terminating. Now the trick used in Example 15.101 does not work. 
In order to construct a terminating /iTRS A^Rq we need to forbid rewriting in all terms of 
the form g(f(t),f(t)). This is impossible using a finite algebra. 
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We need the following adaptation of |Olil02[ Proposition 5.5.24] for /^TRSs; the proof 
proceeds along the same lines. 

Theorem 5.12. Let {R, fJ-) be a terminating many-sorted fiTRS. If the fiTRS obtained from 
{R, fi) by dropping sorts admits an infinite rewrite sequence, then {R, fi) is collapsing and 
duplicating. □ 

While for soundness of the transformation (Theorem 15. 8p a sound labeling suffices, for 
a complete transformation we need the C-labeling to be complete, maximal and core: 

Theorem 5.13. Let R be a TRS over S, and {A,7r,T, ) a complete, maximal, and core 
C-labeling for R. Then J^R is terminating if R is outermost ground terminating. 

Proof. Assume that A^R is not terminating. We turn A^R into a sorted TRS. The sorts 
are chosen from the set ^U {top}. Since the C-labeling is maximal, for each n-ary symbol 
f^ € /a6(Stop) we have A = (ai,...,an). We let f^ have input sort A and output sort 
|{f|(ai, . . . , On). The only exception is the output sort of the symbols top'*' which we fix to 
be the sort top. Then by Theorem 15. 121 together with non-collapsingness of A^R yields the 
existence of a well-sorted infinite rewrite sequence r in A^R. Since the C-labeling is core, 
by Lemma 13.41 there exists a ground term for every sort in A. Thus by applying a ground 
substitution to r we obtain a well-sorted infinite ground term rewrite sequence r'. 

Well-sortedness implies correct labeling: for each well-sorted term t € T(/a6(5]top)> 0) 
there exists a term t' € T(Stop, 0) such that t = lab{t'). Moreover, a symbol top can only 
occur at the top of a term. Without loss of generality we assume that every term in r' 
has top (for some A G ^) as root (as rewriting below top is allowed and context-sensitive 
rewriting is closed under /i-replacing contexts). Hence it suffices to show that for all terms 
s,i G T(S,0) with lab{top{s)) -^A^R,fj. lab{top{t)) we have s ^r t. By construction, each 
rule in A^R is the result of prepending contexts to, and labeling of, a rule in R. Let 
p : s -^R t be the step corresponding to /a6(top(s)) -^A^B.,fi lab{top{t)). We show that p 
is an outermost step. Assume there would be a redex u above the rewrite position. Then 
by completeness of the C-labeling we get root{lab{u)) € S . But then this symbol must 
be in /a6(top(s)), either above the applied rule from /^ R or within the prepended context. 
Both cases yield a contradiction: the former since p{root{lab{u))) = would prohibit the 
/x-step, and the latter because we do not prepend symbols from S . □ 



Let us consider the three conditions of Theorem l5.13l on C-labelings: complete, maximal 
and core. To see that completeness and maximality are necessary, we refer to Examples 1 5. IH 
and 19.41 respectively. The following example shows the need to restrict to core algebras: 

Example 5.14. Let Rj be the following term rewriting system: 

f(a;) ^g(x,f(x)) g(a,2;)^a g(f(2;),y)^a g{g{x,y),z) ^ a (Rj) 

This TRS is outermost ground terminating: First note that without the first rule Ri is 
terminating. So consider a rewrite step f(t) — t- g(t,f(f)) for t € T({f, g, a}, 0). Then one of 
the three g-rules matches g(t,f(t)) and blocks all inner rules by the outermost strategy. 

We take the C-model Ay = {0, 1} with [a] = [[f](x) = [[gKa;,y) = 0, for all x,y e A7. 
We let TT be the maximal labeling and define S^'^'^ = {f'',g'^''^}. Then the dynamic context 
extension A^Rj contains, amongst others, the following two rules: 

fO(x) ^gO'0(x,fO(x)) f^x) ^gi'°(x,fi(x)) 
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where ^(g'^''') = and /i(g^''^) = {1,2}. Consequently, the second rule is not terminating, 
although the original TRS is outermost ground terminating. The C-labeling (^7,7r,S ) 
is complete for R and maximal, but not core. Note that there exists no ground term which 
has the interpretation 1, and hence the label 1 should never occur. 

Theorems 15.81 and 15.131 are about outermost ground termination. This is not a severe 
restriction, as by adding a fresh constant and a fresh unary symbol s outermost ground 
termination implies (and for quasi-left-linear TRSs coincides with) outermost termination: 

Lemma 5.15. A TRS R overT, is outermost terminating if R over T,U{s,0} is outermost 
ground terminating. If R is also quasi-left-linear, the converse direction holds as well. 

Proof. Let X be countably infinite, and cp : X ^!- 'N a bijection. We define a substitution a 
by (t{x) = s'^^'^'{0). Then, we have scr ^ ta whenever s ^ t with s,t £ T(S, A"), since the 
symbols s and do not occur in any pattern of a rule, and for all p,q € Pos{s) we have 
s\p = s\q <^ sa\p = sa\g. This concludes the proof of the first part of the theorem. 

For the converse direction, let i? be a quasi-left-linear TRS such that R over S U {s, 0} 
is not outermost ground terminating. Let t € T(S U {s, 0}, 0) be a ground term of minimal 
size admitting an infinite rewrite sequence t = ti ^ t2 ^ t^ ^ . . .. By minimality, infinitely 
many of these steps must be in the prefix of t not containing s and 0. Let t' S T{T,,X) 
be obtained from t by replacing all subterms with root symbol s or in i by a (arbitrary, 
but fixed) variable x. Then t' admits an infinite outermost rewrite sequence as well. Note 
that by replacing the subterms no redex in the {s, 0}-free prefix of t is destroyed since the 
symbols s and do not occur any rule pattern. Fresh redexes with respect to non-left-linear 
rules may be created (but not with respect to left-linear rules). By quasi- left-linearity, at 
each position where a redex is created, there is also redex with respect to a left-linear rule. 
Hence, no additional redexes get blocked by outermost strategy. D 

The following example shows that extending the signature with a single fresh constant 
only is not enough for the implication: R over the extended signature is outermost ground 
terminating =^ R is outermost terminating. 

Example 5.16. Consider the following term rewriting system R: 

f (x, y) -^ a(f(x, y)) a(f(b, x)) -^ b a(f (x, b)) -^ h 

a(f(a;,x))^b a(f(a(x),y)) ^ b a(f(x, a(y))) ^ b 

a(f(f(x,y),z))^b a(f(x,f(y,z)))^b 

Because of the first rule R is not outermost terminating: 

fix,y)"-4aif{x,y))"4s{3if{x,y)))"-4... 

but the TRS over the extended signature S' = {a,f, b, 0} is outermost ground terminating: 
Consider a step f(s,t) — )• a(f(s,t)) with s,t € T{T,',0). If s ^ one of the rules in the 
second column applies, and if t 7^ then one of the rules in the third column is applicable. 
However if s = t = then the rule a(f(x, x)) -^ h matches. 

Note that for the second part of Lemma 15.151 we require quasi-left-linearity. This 
requirement was erroneously missing from |EH09| , but is necessary as the following example 
illustrates. 
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Example 5.17. We consider the following term rewriting system R: 

f (a;, y, y) -^ a(f (x, X, y)) a(f (x, a(y), 3(2;))) -^ _L 

b^a(b) a(f(x,b,b)) ^± 

a(b) ^± a(f(x,f(yi,2/2,2/3),f(^i,22,23))) ^ -L 

a(f(x, X, x)) -^ ± a(f (x, ±, 1)) -^ ± 

and explain why this system is outermost terminating. Without the rule f (x, y, y) — > 
a(f(x,x,y)) outermost termination of R is obvious; Hence, in an infinite rewrite sequence 
this rule must be applied infinitely often. Let us consider a rewrite step C[f(t, w, w)] ^ 
C[a(f(t, t,n))]. If t G <Y, then u = t since no non-variable term rewrites to a variable; 
then a(f(x,x,x)) — )• ± is applicable and has priority (by outermost strategy) over all inner 
rewrite steps (and we terminate). If t ^ X, then the second argument u and third argument 
t in a(f(t,t, n)) have to rewrite to a common non-variable reduct (in order to make the 
first rule applicable again). However, as soon as the common reduct is reached, one of the 
rules displayed on the right would be applicable and have priority by outermost rewriting 
strategy. 

Nevertheless, R over the signature S U {s, 0} is not outermost ground terminating: 

a(f(s(b),s(a(b)),s(a(b)))) "-4 a(a(f(s(b), s(b), s(a(b))))) 

^a(a(f(s(b),s(a(b)),s(a(b))))) 

6. Dynamic Labeling 

This paper is about employing context-sensitive rewriting to model outermost rewriting. 
We do so by marking redexes, and forbid rewriting below them. As we have seen, contracting 
a redex may create another redex higher up in the term tree. Hence it may be necessary 
to update some labels during a rewrite step. Li Section [5] we defined a transformation 
where this updating was accounted for by extending rules with contexts. Here we give an 
alternative transformation from TRSs to context-sensitive TRSs. We call this tranformation 
'dynamic labeling'. Instead of extending rules with contexts, we now employ rewriting to 
propagate the changed information upward in the term tree, and set the labels in the 
surrounding context right, step by step. Again the C-depth (Definition 13. 8p serves as a 
bound: here on the number of successive ancestor nodes that have to be relabeled. Each 
original rewrite step will give rise to a corresponding step and a bounded number (< the 
C-depth) of auxiliary steps in the transformed system. Thus, although the derivational 
complexity (the length of rewrite sequences) is changed, this is only by a constant factor. 
We prove that dynamic labeling is sound for arbitrary TRSs. Moreover, for left-linear 
TRSs, the method is complete in a weakened sense, see Theorem 16.121 In Section \T0\ we 
compare the performance of this method to the one of dynamic context extension described 
in Section [5l 

We begin with an analysis for evaluating which value changes can occur by rewriting 
and need to be propagated upward. As we will see, this restricts the number of auxiliary 
'relabel symbols', and, in particular, the number of 'relabeling rules'. 
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Definition 6.1. Let R he a TRS over S, and let {A,7r,J:'''"^) be a C-labeling for R. For 

i = 0,l,2,...,we define the set Lj C ^ x ^ inductively by: 

Lo = { {ll,al lr,a})\i^rGR,a: Var{£) ^ A, ll,a} ^ |r, aj } 
L,+i = LiU{(Ifl(a,6,c),[fl(a,6',c)) |f gS, a -b-c G ^«^ {b,b') e U, 

Then we define the set L'^{R) of value-change pairs by: 

L-(i?) = L,\{(a,a)},e^ 
with i the least number such that Lj+i = Lj. 

The 'dynamic labeling' f'^i? of a TRS i? is partitioned into two sets of rules. The 
first set is denoted by ti-R and consists of a semantic labeling of the original rules, where, 
additionally, a right-hand side is prefixed by a symbol relabel"''^ whenever application of 
the rule causes a change of interpretation from a to a' . The second set, ^2-^1 i^ ^ ^^^ °f rules 
for relabeling the context of the rule application. A symbol relabel'^'" , with (a, a') G L'^{R), 
indicates that the value of its subterm has changed from a to a' , and the rules in ^2-^ take 
care of propagating this change of value upward in the term. 

Definition 6.2 (Dynamic labeling). Let i? be a TRS over S, and let (.4, vr, S'"'^'^) be a 
C-labeling for R. The TRS t''^ over the signature lab{T.top) U {relabel"'"' | {a, a') G L'^iR)} 
is defined by t'^i? = ti -R U ^2^- Here the set "[JR of labeled rules contains, for each rule 
i —^ r ^ R and assignment a : Var{i) -^ A, one of the rules: 

labd a) ^ I ^"'^^^^ "^ ^^ ^^' "^ " ^^' "^ 

^ ^ '"^ |relabe|I^'"l'I^'"J(Za6(r,a)) otherwise 

Secondly, the set ^2-^ of relabeling rules contains, for each n-ary f G S, (6, b') G L'^{R), and 
{a, b, c) G A"' such that f'^ G /a5(Stop) \ S'"'^'^ with A = vTf (a , 6, c ), one of the rules: 

rA/^ , , ,hh' , ■. ^N I f''* (25*,?/,^) if (i = d' 

f\£,relabel^'^(y),i')^<^ iui^>/.AV- -^^ ,v. • 

I relabel' (f (x , y, 2 )) otherwise 

where A' = vTf (a , 6', c), d = [[f](a , 6, c), d' = |f]](a , b' , c), |x | = \a\, and |z | = |c |. 

The dynamic labeling of R (with respect to the C-labeling {A,tt,T,'^^'^)) is the context- 
sensitive TRS {f^R, //), where the replacement map /x is defined by /^(relabel"'" ) = for ah 
(a, a') G L^(i?), ^(f) = if f G S'"'^'^, and ^(f) = {!,..., tjf} otherwise, for aU f G /a6(Stop)- 
Whenever T,^'^'^ is clear from the context, we leave fi implicit, and overload the notation \'^R 
to denote {YR,n). 

Example 6.3. We revisit the TRS Ri from Example 13. 91 for which we worked out the static 
and dynamic context extensions in Examples 14.21 and 15.11 We repeat its definition and the 
C-labeling from Example 13.121 Ri is the TRS over S = {a,f, g} consisting of the rules: 

f(g(x)) ^ f(f(g(x))) m^{x))) ^ X (Ri) 

A C-model for Ri is formed by the S-algebra Ai = {J-, f,ff,g} with interpretation: 

ici = ± m{±) = m(g) = f m{f) = iq(ff)=ff M{x) = g mo 

for all X G Ai- Furthermore, (^i,7r) denotes the maximal labeling for Ri, and S = 
|f9 fif|_ Then {Ai,7r,T, ) forms a sound and complete C-labeling of Ri. Also note that 
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Ai forms a core algebra; for each value e G Ai there is a ground term t such that [ij = e. 
We first compute the set L'^{Ri) of value-change pairs. For the initial set Lq, note that the 
rule f(g(x)) — )■ f(f(g(2;))) changes the interpretation from / to ff , regardless of the value 
assigned to x. The other rule creates three value-change pairs; one for each of the values 
g, _L, / assigned to x. If the interpretation of x is j(f there is no change. Hence we get: 

^0 = {(/,#>, (#,^), (#,/), (jO^,5)} 

All symbols relabel^'*^ with (e,e') € Lq will disappear in one relabeling step, whence 
L^'iRi) = Lq. The dynamic labeling of Ri then is t''^! = ti^i U t2^i where ti^i 
consists of the rules: 

f9(g^(x)) ^ relabel^'^(f-^(f5(g"(x)))) for aU e G Ai 

^S{^S{^<^'{x))) ^ relaber'^'(x) for ah (e, e') e {(#, ±), (#,/), (#,5)} 

^ff {^ff [W [x))) ^ X 

and where t2-^i ^^ formed by: 

g<=(relaber'^'(x)) ^ g^'(x) for all (e,e') G L^(i?i) 

f-^(relabel-^'-^(x)) ^f-S'(x) 

In total the dynamic labeling of Ri has 13 rules. Had we not restricted the construction 
of the set of the relabeling rules to the 'reachable' symbols relabel'^''^ (by the requirement 
(e, e') G L'^{Ri) in Definition 16. 2p . we would have come up with 18 instead of 5 relabeling 
rules. 

Example 6.4. We reconsider the term rewrite system R^ from Example 15.51 

f(h(x),c) ^ f(i(x),s(2;)) \{x) ^ h{x) 

f{\{x),y) ^ X h(x) -)■ f(h(x),c) 

and the algebra ^3 = ({_L, c, h, i}, [[•]]) with [•]] defined, for all x,y £ A3, as follows: 

M = c [h] (x) = h |il (x) = z m (x, y) = H (x) = ± 

Moreover, we employ minimal labeling again; see Example 15.51 
The set of change- value pairs is: 

L^Rs) = {{±,c),{±,h),{±,i),{i,h),{h,±)} 

The set ti-^s of labeled rules is constructed thus: 

f*(h*(x),c)^f*(i*(x),s(x)) 

f*(i*(x),y) ^x 

f*(i^(x),y)^relaber'^'(x) {e,e') e {{±,c),{±,h), {±,i)} 

\*{x) -^ relabeP''^(h'^(x)) 

h*(x)^relabel'^'^(f*(h*(x),c)) 

There are four rules with left-hand side i = f*(i*(x),y), one for each value assigned to x. 
In case a(x) = _L there is no change of interpretation, for we have that p, a] = _L for all 
a : {x,y} — )• .A3 and so no relabel symbol is inserted. But if, for instance, [[(7(x)] = c for 
some substitution a, then some labels in the context C of a rewrite step C[ia] -^ C[(t{x)] 
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have to be updated, since the value of □ has changed from _L to c, whence the insertion of 
relabel"'"'^ to the right-hand side x. 

The set t2-^3 of relabehng rules is formed by: 

{e,e')e{{±,c),{±,h),{h,±)} 

{e,e')e{{±,h),{±,i)} 

(e, e') G L^Rs) 

(e,e')eL-(i?3) 

{e,e')eL^R3) 

Some remarks for clarification: First, note that all relabel symbols disappear upon one 
relabeling step. Secondly, observe the overlap in, for example, the rules with left-hand side 
f(relabel ' {x),y). If the value assigned to y is c, then a redex is created; this is witnessed 
by the marked symbol f* on the right. For other values for y, this is not the case. Also 
note that there is no rule for t = f(relabeP' {x),y). This is because when the left argument 
of f is interpreted as i, then t forms a redex, and so f should be marked. Definition 16.21 
does not allow relabel symbols to commute with redex symbols. Intuitively, a relabel symbol 
is a witness of a rewrite step which we do not want to occur inside other redexes, as we 
want to model outermost terminination. However, more technically, sometimes illegal (i.e., 
non-outermost) relabel steps are allowed. This is illustrated in Example 16.111 The point is 
that by preventing relabel symbols to commute with redex symbols, for local completeness 
(Theorem I6.12p it is as if illegal steps never happened. 

Remark 6.5. We elaborate on the role of the element a in relabel"''^ . Whenever the 
application of a rule C[£a] — )• C[ra] changes the interpretation, i.e., fia} / i^aj, then 
a symbol relabel'^^'^^'I'^'^l' is inserted. A term of the form relabel*^'" (t') can be thought of 
as a witness of a rewrite step t ^ t' causing a change of interpretation from a = [t] to 
a' = |t'] . This change of the value then needs to be propagated upward to update the labels 
accordingly, using the relabeling rules from t2^- ^^ ^^^^ sight, the value a in relabel'*'" (t) 
seems redundant for relabeling: why would we store the previous value? However, the label 
a is important in order to restrict the number of applicable rules, and to have a bound on 
the number of relabeling steps. To see this, consider the system Rq with single rewrite rule: 

f(g(f(x))) ^ d (Rs) 

and the algebra As = {±, /, gf} with [fKx) = / for ah x G As, Mif) = gf, M{x) = ± for 
all X ^ f, and [dj = _L. We employ minimal labeling, that is, only iTf{gf) = * , and all the 
other symbols are unlabeled. 

The dynamic labeling t'^-Rs gives rise to two labelings of the original rule: 

f*(g(f(x))) -^ relabel-^'^(d) (6.1) 

f*(g(f*(x))) ^ relabel-^'^(d) (6.2) 

And, among the fourteen rules in t'^-Rs for updating labels, we find the following two: 

g(relabel^'^(x)) -^ relabel»-^'^(g(x)) (6.3) 

g(relabel^-^'^(x)) -^ g(x) (6.4) 
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Now consider the term t = top(g(- • • (g(g(g(f*(g(f(d))))))))), and the rewrite sequence: 

* ^EHm top(g(- • • (g(g(g(relabel^'^(d))))))) 

^E31m top(g(- • • (g(g(relabel^/'^(g(d))))))) 

^1631/. top(g(- • • (g(g(g(d)))))) 

After an application of ()6.ip . relabehng takes two steps, resulting in a correctly labeled 
term. 

In the alternative, let us say 'forgetful' version of dynamic labeling, where the 'from' 
value a in symbols relabel'^' is omitted, the rules (J6.ip ~ (j6.4p look like this: 

f*(g(f(x))) ^ relabel^(d) dH]) 

f*(g(f*(2;))) ^ relabel^(d) dO) 

g(relabel^(x)) -> relabe|-L(g(x)) (jOT ) 

g(relabel^(x)) ^g(x) ^6J\ ) 

Due to the overlap in rules (|6.3ip and (j6.4ip . the resulting /iTRS has a rewrite sequence 



from t where the symbol relabel goes up all the way to the top: 

t -mn>^ top(g(. • • (g(g(g(relabel^(d))))))) 

^lEll;.top(g(---(g(g(relabel^(g(d))))))) 

^(ODMtop(g(---(g(relabel^(g(g(d))))))) 

"^ICTlu ■ ■ ■ 

From the following lemma it follows that every relabel symbol can be rewritten at most 
S_a{R) times (before it vanishes). By rewriting a 'relabel symbol' we refer to a notion of 
residuals that extends the usual definition of orthogonal projection [Ter03] with a concept 
suggested by the definition of t2 -^ • Whenever we have a rule of the form: 

f^(£, relabel'"' (/a5(i)), -2*) ^ relabel'''^' (f^' (x, Za6(t),z)) 

then we call relabel ' in the right-hand side a residual of relabel"'" in the left-hand side. 

Lemma 6.6. Let R be a TRS over S, and let {A, vr, Tj ) be a C -labeling for R. We define 
the relation -^ C L^'iR) x L''(i?) by: 

(6,6')-^(IfKa,6,c),IfKa,6',c)) for every f € S, (6,6') eL'^iR), a-b-c € A^^ , 

Then -^ is well-founded and every '^ path has length < S_4^{R). 

Proof. By definition of value-change pairs we have that for every pair (6, 6') G L'^{R) there 
exists a rule i ^ r G R and assignment a : Var{£) — > A such that {{£, a}, [r, q]) -->* (6, 6'). 
Assume, to arrive at a contradiction, there exists a sequence 

Hi, al [r, aj) = (6o, 6'o) ^ (6i, 6;) ^ . . . ^ (6„, b'J 

with m > Sj\^{R). For i = 0,1, ... ,m we construct thin contexts Di and assignments 
ai : Var{Di) -^ A such that 6^ = [[A[^],ai]] and 6^ = [[A[?-],ail- We begin with Dq = D 
and ao = a. Then we have 6o = li, a] and 6q = [r, a}. For i = 1, . . . ,m there exist fj € S, 
and dl ■ 6j_i • c7 G A^^' such that 6j = |fj(a7 , 6j_i, c7) and 6- = {Ujidi ,b^_-^^,c[). We pick 
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fresh variables xl and zl with |a?i | = |di | and \zi \ = |c7 1, and define Di = ii{xi , -Dj-i, il ), 
and Ui is aj_i extended by mapping variables ocl to the corresponding dl and zl to q. It 
follows that bi = lDi[i],aij and b'^ = lDi[r],aij. But then [£>[£], a] = bm ^ b'^ = lD[r],aj 
which contradicts that 6j[{i — > r) is the C-depth of ^ — > r. □ 

Corollary 6.7. Every relabel symbol disappears at latest after having applied Sj[{R) many 
relabeling rules (to this symbol). 

Proof. For every rule in t2-^ o^ the form: 

f^(x, relabel"-'^' (/a6(t)),£) -^ relabel^'''' (f^' (£, /a6(t), z)) 

we have that (a, a') -^ (6, 6'). D 

For the dynamic context extension, the 'intended' terms in 'T{lab{T,), 0) are those terms 
that can be obtained by correctly labeling terms in T(S,0). For the purpose of adapting 
this definition to dynamic labeling, we enrich the (unlabeled) signature S to S+: 

S+ = S U {relabel'' | (a, a') G L^(-R)} 
and extend the C-labeling to S-(_ by: 

^reiabei^&O = ft' [relabel^l = 6 

for all b,b' (^ A such that (6, c) € L'^{R) for some c G ^. Then labeled symbols are identified 
by (relabel")'^' = relabel"'"'. 

We obtain the following lemma: 

Lemma 6.8. Let R be a TRS over S, and let {A, vr, S ) be a C-labeling for R. Whenever 
we have a ground term s of the form: 

s = lab{C[^{si, ..., relaber(t), . . . , s„)]) 

with {a, a') £ L^{R), a' = Jt], and where the displayed relabel symbol is at a ^-replacing 
position, then one of the following steps applies: 

s ^tIi?,M lab{C[re\^be\\^{si, ...,t,..., s„))]) (6.5) 

s ^tlfi.M lab{C[f{si, ...,t,..., s„)]) (6.6) 

where b = [[f(si, ...,□,..., s„), □ i-^ aj. 

Proof Let b' = [[f(si, . . . , t, . . . , s^)]. Note that b' = [f(si, . . . , D, . . . , s„), □ ^ a'j. Then: 

lab{f{si, ..., relaber(t), . . . , s„)) = f^{lab{si), ..., relabel"-"' (/a6(t)), . . . , lab{sn)) 

k6(relabel''(f(si, . . . , t, . . . , s„))) = relabel'''''' (f^'(Za6(si), . . . , lab{t), ..., /a6(s„))) 

where A = 7rf([[si]l, • • • , a, • • • , Isnj) and A' = vrf(|si]], . . . , a', . . . , l[s„]]). 

By Definition 16.21 the dynamic labeling 'I'^R contains a rule of the form: 

f^(£, relabel"'"' (/a6(t)),i') ^ C[f^' {x ,lab{t),z)] 
with C = □ or C = relabel ' (□). Consequently we have a step of the form: 

lab{f{si, ..., relabel"(t), . . . , s„)) ^t"fi,M lab{D[f{si, . . . , t, . . . , s„)]) 

with D = n or D = relabel''(n). 

Now the claim follows since s = lab{C, □ i-^ b)[lab{f{si, . . . , relabel"(t), . . . , s„))]. D 
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Lemma 6.9. Let R be a TRS over S, and let {A, vr, S ) be a sound C -labeling for R. Let 
s,t € T(5], 0) be ground terms such that s ^/j t. Then, for some m < Sj[{R) : 

Proof. Assume s ^R,p t for some position p € Pos{s). Then there exists a rule i ^ r £ R, 
a context C with root{C\p) = □ and a ground substitution a : X —>■ T(S, 0) such that 
s = C[ia] and t = C[ra]. Let C^ = /a6(top(C), □ i— )• a) and a = lab{(T), then by Lemma [3.61 
we obtain: 

lab{top{s)) = lab{top{C[ea])) = C^,^^ [lab{ea)] = q.^j [lab{e, M)a] (6.7) 

and, hkewise: 

labitopit)) = Cfra}[lab{r, laj)a] (6.8) 

By definition of dynamic labehng, one of the following rules is in ti -^^ 

lab{i, la}) ^ lab{r, {a}) if {la} = {raj (6.9) 

lab{i, laj) -^ relabe|I^'^J'I'''^J(k6(r, [ctD) if {laj ^ {raj (6.10) 

(Note that {£, {ajj = {iaj, and |r, {a}} = {raj by Lemma O) 

In case lia} = Ira}, no relabeling is needed and we take m = 0: 



lab{top{s)) = Cjfoi [lab{£, la}) a] ^tJ^.M C'lH [l(ib{r, la}) a] "^^ /a6(top(t)) 
Ifl^al/M,weget: 

labitopis))^C^,,^[labii,la})a] 

->";. Ci,^^[re\Bbe\^'^Ur^Hlab{r, la}))a] = lab{top{C[re\ahe\^''^Hra)])) 

By Lemma 16.81 the relabel symbol can 'walk' upward until it disappears, and at the latest 
it vanishes when it meets top. Hence we have: 

lab{top{s)) ^tr/?,M /aKtop(C[relabe|I^'^I(m)])) ^!f|^,^ lab{top{C[ra])) 

for some m < Sji{R) by Lemma [6.61 □ 

Theorem 6.10. Let R be a TRS over E, and {A, it, S ) a sound C-labeling for R. Then 
R is outermost ground terminating if^^R is terminating. 

Proof. Assume that R admits an infinite outermost rewrite sequence of ground terms: 

, O'id , oid , out 

tl ->R 12 ->R E3 ^R ■ ■ ■ 

Then from Lemma 16.91 it follows that "[^ R admits an infinite rewrite sequence: 

lab{top{ti)) -^pR^^ lab{top{t2)) -^pR^^ labitop^ts)) ^pR^^, ■■■ D 



We note that Theorem 16.101 can be strengthened by weakening the termination of ^^R 
to local termination of '['^R on the set of correctly labeled ground terms without relabel 
symbols. Let us denote this set by lab{T{Ti, 0)): 

lab{T{T., 0)) = {lab{t) \ t G r(S, 0)} 

Theorem 16.121 below states that dynamic labeling is complete with respect to local 
termination on /a6(T(S,0)). More precisely, outermost ground termination of R implies 
termination of '['^ R on lab{TiTi, 0)). The following example helps to understand the proof 
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of that theorem; it illustrates that even when starting from terms in /a6(T(S, 0)), not every 
rewrite step in ']''^R corresponds to an outermost step in R. 

Example 6.11. Let R consist of the following rules: 

f(b,x) -> a f(2;,b) -> a 

f(b,b) ^f(a,a) a^b 

Moreover, let A = {-L, b} with |a]] = ±, |b]] = b, and m{x, y) = 1. for ah x,y G A. Labeling 
symbols with the value of their arguments, we obtain for ti ^^ 

f^'^(b,x)^a f^'''(x,b)^a 

f^'^(b,2;)^a f'''''(x, b) ^ a 

f'''^(b, b) -^ f^'^(a, a) a -^ relabel^'*(b) 

and for '['^R: 

f^'^(relabel^'^(x), 2/) -^ f'''^(x, y) f^'^(x, re\abe\^^''{y)) -^ f^'^(x, y) 

where S'"'^'^ = {a,f^'-'-,f-'-''',f*'*}. Then we obtain the following rewrite sequence in "['^R: 

k6(f(a,a)) = f^'^(a,a) 

^tri?,Mf^'^(''elabel^'''(b),a) 

^trfl,M f^'^(relabel^'''(b), relabel^'''(b)) 

^tJfl,Mf''^(b,relabel^'^(b)) 

The second step in this rewrite sequence does not correspond to an outermost step. Nev- 
ertheless, Theorem 16.121 states that such 'illegal' steps do not harm completeness of the 
transformation. The reason is that if the relabeling rules create a redex above some relabel 
symbol, then this relabel symbol is prevented from further propagating its information up- 
ward (until it becomes //-replacing again). The crucial point is that above relabel symbols 
the labels are unchanged, thus as if the step would not have taken place. Moreover, it is 
essential that t'^i? prohibits relabel to propagate over symbols from S . For instance, in 
the above example ^'"R does not contain a rule of the form: 

&^{x, relabel^'^(y)) ^ f^'^{x, y) (6.11) 

This rule would cause non-termination: 

f^'^(a, a) ^^2 f^-^(relabel^'''(b), relabel^'*(b)) 

^^f'''^(b,relabel^'^(b)) 

^f^'''(b,b) 
^^f^'^(a,a) 

Theorem 6.12. Let R be a left-linear TRS over S, and (^, tt, S'"'^'^) a complete, maximal, 
and core C -labeling for R. Then '\'^R is terminating on the set of terms lab{T{T,, 0)) if R 
is outermost ground terminating. 

Proof Define T = lab{T{T,, 0)), and ^ = (^tr^.M ' ^r^.M^ ^ (^ ^ ^)- ^0*^ ^^^^ *^e rela- 
tion ^^ is restricted to terms which contain no relabel symbols. Hence always the maximal 
number of relabeling rules is applied. It is clear that each ^-7> rewrite step corresponds to 



) — )• lab[r, a) 


(clj 


)^relabe|I^'°l'I^'"I(Za6(r,a)) 


(c2) 


)^fMa,b',c)^^^y^^-) 


(c3) 


) ^ relabel'^''^'(Pf('^^'^''^^)(x,y,z)) 


(c4) 


■fR./i ti+i ^^d construct sq "^^^ si ^-> . 


. . M- Sj in 
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an outermost rewrite step in the original TRS R. Therefore it suffices to show that any in- 
finite rewrite sequence t = to — ^t'"^,/^ *i ~^VR,f^ ■ ■ ■ gives rise to an infinite rewrite sequence 
t = So ^^ si ^^ . . .. We prove the claim by a kind of standardization of reductions. We first 
classify the rules from ^'^R: 

lab{i, a) 
lab{i, a) 

For i = 0, 1, . . ., we analyse the steps U 

such a way that Sj ~^^H^2lr4l**+i '^'^^'^^ ^^ ^^^ ~~ ^c2lc4l ^Q denote standard term rewriting 
ignoring the replacement map /i, and using rules from ([c2]) and ([c^ only. Observe that 
then the maximal prefix Cj+i of tj+i not containing relabel symbols is also a prefix of Sj 
(since everything changed by ([c2]) and (jcl|) is 'hidden' inside a relabel symbol). We begin 
with t = So, and i = j = 0. For i = 0, 1, . . ., we consider the step Tj : ti -^^^r^^ U+i- 
If Tj is a step with respect to a rule from: 

— (ic2]) or (fc4|l . then we append tj to the rewrite sequence Sj ~'^> | c2 l c4 l^^ yielding Sj ~'^> | c2 l c4 l 
tj+i. Note that this leaves the "^^-rewrite sequence sq ^^* Sj untouched. 

— ([cTj) , then the pattern of Tj lies entirely in Q which is also prefix of Sj . Then we append Tj 
to Sq '—>■* Sj (using left-linearity of R) yielding sq ^-s-* Sj ^ -Sj+i- We have Sj+i "'^^Uo l M l 
tj+i by orthogonal projection of the steps Sj ~'^>U2 l r4 l ** °^^^ ^^ ^ ■*.?+! (^^^ steps in 
Sj ~'^) ] *2 | ^4 | ti are below the prefix Cj). 

— ([c3|) . then a relabel symbol 'disappears'. We can trace this symbol back to a sequence 
of steps (Tj : Sj — ^^^[^- "'^^ i l'^ i s', that is, it must have been created in Sj by a ([c2]) 
step, followed by a number of ([cl]) steps. We combine ai and tj to a ^4- step, yielding 
So ^^* Sj M-'sj+i. Then Sj+i "'^^Uo l M l ^^+^ ^^ ^'^^ remaining steps from Sj ~'^>U2 l r4 l ^^ 
are not harmed by the permutation (performing Uj first). 

It remains to be shown that the constructed sequence sq M- si "^ S2 '^ . . . is infinite. This 
follows from the fact that an infinite number of steps in to — ^-ffi,^ ti -^f^R^^ ■ ■ ■ must be of 
type (icTI) or (fc3|l . This is a direct consequence of the fact that — ) |g2|c4| is terminating (with 
every step the prefix in which rewriting is allowed gets smaller). □ 

The following example demonstrates why the completeness result for dynamic labeling 
(Theorem I6.12p is restricted to the set /a&(7~(S,0)) of correctly labeled terms which do 
not contain relabel symbols. The point is that, although the original TRS is outermost 
terminating the transformed system may in general be non-terminating due to the existence 
of 'non-reachable' terms. 

Example 6.13. We consider the following term rewriting system R: 

a ^ b f(b, y) ^ b f (c, y) -^ h(f(y, y)) 

h(f(x,b))^b h(f(x,c))^b 

We explain why this TRS is outermost ground terminating. Without the rule p : f (c, y) — )• 
h(f(y,y)), the system would even be terminating. Now note that the rule p can only be 
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applied once to each occurrence of f(c, □) since h(f(t,t)) — >* h(f(c, t')) implies that i = c, 
and then the rule h(f(x,c)) — > b has priority by the strategy of outermost rewriting. 

We define a maximal, complete, core C-labeling (^, 7r,E ) for R (isomorphic to the 
result of the construction given in the next section) where the algebra A = {bc,fbc, _L} with 
the interpretation function defined by: 

lfjix,bc)=fbc lbj = 14 = be 

m{x,y) = ± [a] = Ih](x) = ± 

for all x,y (£A with y / be, and with W""^ = {a, f^^ h^*'^}. 

The dynamic labeling '['^R of R with respect to this C-labeling then includes the rules: 

a ^ relabel^'^'=(b) 

f*'^'^(c,y)^h^(f^'^(y,y)) 

f^'^(relabel^'*'^(x), y) -^ f^^'^(x, y) 

Now the context-sensitive TRS t'^i? admits the following infinite rewrite sequence: 

f^^'^(c, relabel^''"=(c)) ^rR,,^ h^(f^'^(relabel^'^=(c), relabel^'^^(c))) 

^r/?,Mh^(f''''^(c,relabel^'^^(c))) 

~^VR,fj' ■ ■ ■ 

Observe that this anomaly is caused by the subterm relabel ' "^(c), which is not reachable 
from any term in lab{T{T,, 0)). 

Remark 6.14. Theorem 16. 121 states completeness of dynamic labeling with respect to local 
termination on the set of terms lab{T{Ti, 0)). We briefly indicate how the theorem can be 
generalized to termination on T{labiTi),0) by altering the definition of '\'" R. Note that 
/a6(7'(S,0)) CI T(Za6(E),0). In particular, the set T(^a&(S),0) includes terms that are 
not correctly labeled. The necessary modification of the definition of dynamic labeling 
concerns the elimination of collapsing rules £ — >• x. This can be achieved by wrapping the 
right-hand side into relabel'^'"(n) even when the interpretations of the left and right-hand 
side are equal. Additionaly, we let the symbols relabel"'*^ disappear after one relabeling 
step. By an application of Theorem 15.121 it then follows that termination on /a6(T(S,0)) 
coincides with termination on T{lab{T,), 0). 

7. Constructing Suitable Algebras 

We construct C-models which are able to recognize redex positions with respect to left- 
linear rules. The construction of C-models is similar to the construction of a deterministic 
tree automaton (DTA, [CDG"'"07] ) for recognizing left-linear redexes [ComOO] . A DTA is a 
S-algebra {A, [•]]) with a distinguished set Ap C A of final states. A term t is accepted by 
the automaton whenever [tj € Ap. A difference with the construction of a DTA is that for 
the construction of a C-model we do not distinguish final and non-final states, but instead 
have a family of functions isRedexf : A* — )• Bool for indicating the presence of a redex. 
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Definition 7.1 (Redex-algebra) . A redex-algebra {A,isRedex) consists of a S-algebra A 
together with a family {is-Redexfjfgs of functions isRedexf : A^ — > Bool. The language of 
A is the set: 

C{A) = {iiti,...,tn) er(5],0) I isRedexfUhj, ... ,ltnj) = true} 

Let i? be a TRS. A redex-algebra A is called sound for R whenever t € JC{A) implies that t 
is a redex, and A is called complete for R if for all redexes t G T(S, 0) we have t € >C(^). 

Intuitively, a redex-algebra needs to 'remember' only the subterms ii, . . . ,t„ and not 
the term f(ii, . . . , i„) itself. To see this, consider the one-rule system: 

f(g(x)) ^ a 

A tree automaton which can recognize redex positions for this TRS needs at least three 
states: one for indicating a redex f(g(. . .)), one for g(. . .), and one garbage state. For 
redex-algebras two states suffice: one state for g(. . .) and one for garbage, and then we use 
isRedexf{g{. . .)) = true and false, otherwise. 

We now describe a syntactical construction of redex-algebras. The algebras we construct 
are C-models: the C-depth of a rule ^ ^ r is the maximal pattern depth of a left-hand 
side (minus 1), since for recognizing the subterms of left-hand sides we may 'forget' all 
information that lies below the patterns. We first define some auxiliary functions, introduced 
with different notations in |KM9HlHL9T] . 

Definition 7.2. Let S be a signature and X a set of variables. We let _L be a new symbol, 
_L ^ S, and we define T± = T(S U {_L},0). The function cut : T{T,,X) — ;■ 71 is defined 
such that cut{t) is the result of replacing all variables in a term t £ T^S, X) by _L: 

CUt{x) = _L CMt(f(ti, . . . , tn)) = f(cu^(ti), . . . , CUt{tn)) 

We define the function match : 7j_ x 71 — ^ Bool such that match{s,t) = true if s can be 
obtained from t by replacing subterms of t by _L, and we let match{s,t) = false, otherwise. 
Further, we let merge{s,t) be the 'most general common instance' of s and t, that is, 
merge : 71 x 71 ^ 71 is the partial function defined by: 

merge{-L,t) = merge{t, 1) = t 

merge{f{si, ..., s„), f(ti, . . . , t„)) = f{merge{si,ti), ..., merge{sn, tn)) 

Hence merge{s,t) is undefined whenever there exists a position p € Pos{s) such that 
root{s\p) G S, root{t\p) £ S, and root{s\p) / root{t\p). For a term s £ T± and a set 
T C 71 we define the term shrink{s, T) as the largest t (zT (with respect to the number of 
symbols) such that match{t,s) = true. Note that shrink{s,T) is well-defined whenever T 
is closed under merge and _L € T: whenever two terms ti ^ t2 of equal size match s then 
merge{ti,t2) is larger and matches s. 

Definition 7.3 (Construction of redex-algebra). We define a mapping F which constructs 
a redex-algebra for a given TRS R. Let F{R) = {A, isRedex) where A is the smallest set 
such that: 

— ±eA, 

— t (^ A for every proper subterm t of cut{i) with I a left-hand side of a rule in R , 

— merge{s,t) S A whenever s,t (^ A and merge{s,t) is defined. 
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The interpretation function [[•]] of A, and the functions isRedex are defined by: 
m{ti,...,tn) = shrink{f{ti,...,tn),A) 

I true if match{cut{i),i{ti, . . . , t„)) = true for some i ^ r ^ R 



isRedexf{ti,...,tn) , . , 

I false otherwise 



for all function symbols f G S with arity n, and terms ti, . . . , t„. 

The core of the algebra F{R), which we denote by F{R)c, is called the full redex-algehra 
for R. Furthermore, let 5 C i? be the set consisting of all left-linear rules of R. Then F{S)c 
is called the left-linear redex-algehra for R. 

Of course, if i? is a left-linear TRS, then a left-linear redex-algebra for R also is a full 
redex-algebra for R. Moreover, if i? is a quasi- left-linear TRS, then the minimized left-linear 
and full redex-algebras for R are isomorphic. Minimization of redex-algebras is introduced 
in the next section. We now consider two examples which illustrate that F{R)c indeed can 
be a proper subalgebra of F{R). 

Example 7.4. We consider the term rewriting system R which consists of the rules: 

a(x) ^> f(a(x),x) f(x,a(7/)) -)■ b f{x,f{y,z))^b f (x, b) ^^ b 

This TRS is outermost ground terminating, but it is not outermost terminating. We 
construct the redex-algebra F{R) = {A,isRedex) where A = {±, a(±),f(_L, _L), b} with 
[[a](x) = a(±), [[f]](x,y) = f(_L,±) and Jb] = b for all x,y e A. But note that _L is not part 
of the core, and hence the left-linear (full) redex-algebra F{R)c contains only the elements 

{a(±),f(±,±),b}. 

Example 7.5. Consider the term rewriting system R consisting of the rules: 

h(h(h(x)))^a h(h(a))^h(h(h(h(a)))) 

The domain of the redex-algebra F{R) is {h(h(±)), h(_L), _L, h(a), a} with the interpretation 
of the symbols defined by: 

[al = a [hl(a) = h(a) M{h{.)) = h(h(±)) N(h(ha))) = KK^)) 

The values _L and h(±) are not part of the core, and hence the domain of the left-linear (full) 
redex-algebra F{R)c is {h(h(±)), h(a), a} with isRedexh{^{a)) = isRedexh{h{h{±))) = true, 
and false otherwise. 

The next example illustrates the use of the function merge in the construction of a 
redex-algebra. 

Example 7.6. We construct the left-linear (full) redex-algebra for the TRS: 
f(x, y) -^ a(f (c(x), y)) a(f(c(c(x)), y)) -^ e 

f (x, y) ^ b(f (x, c(y))) b(f (x, c(c(y)))) ^ e 

The subterms of cut{i) of linear left-hand sides i are: 

S = {±, f(c(c(±)),±), f(±,c(c(±))), c(c(±)), c(±)} 
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and closure of S under merge yields the algebra ^ = 5 U {f(c(c(_L)),c(c(±)))} with the 
interpretation function defined by: 

|al(x) = [bKx) = M = ^ forallxG^ 

[cKc(±))=c(c(±)) 
Icl(c(c(±)))=c(c(±)) 

[cKx) = c(±) for all x ^ {c(±),c(c(±))} 

IfKc(c(±)),c(c(±)))=f(c(c(±)),c(c(±))) 

Ifl(c(c(±)),x) =f(c(c(±)),X) for all X / c(c(±)) 

[fl(x,c(c(±))) =f(±,c(c(X))) for allx/c(c(±)) 

[[f]](x,y) = J- otherwise 

For the family of isRedex functions we obtain: 

isRedexf{x,y) = true for all x,y G A 

zsi?edeXa(f(c(c(_L)),_L)) = true 

isRedeXa{f{c{c{-L)),c{c{±)))) = true 

isRedexb{f{-i-,c{c{±)))) = true 

isRedexb{f{c{c{±)),c{c{±)))) = true 

and isRedex returns false in all remaining cases. Finally, note that the core of the con- 
structed algebra is the algebra itself, i.e., F{R)c = F{R), because e G S with [e] = _L. 

The following theorem states that left-linear (full) redex-algebras recognize only redex 
positions (at least all redex positions). 

Theorem 7.7. Let R be a TRS over S. The following properties hold: 

(i) The left-linear redex- algebra for R is sound. 
(ii) The full redex-algebra for R is complete. 

(iii) Let A be the left-linear redex-algebra for R. For all t E T(S,0) we have t G ^{A) 
if and only if t is a redex with respect to a left-linear rule in R. Hence, if R is 
quasi-left-linear, then the left-linear redex-algebra for R is sound and complete. 

Proof. We prove (ii) and leave (i) and (iii) to the reader. 

Let {A, isRedex) = F(R)c with F the mapping defined in Definition 17.31 Let t £ 
T(S, 0) be a redex with respect to a rule £ ^- r G R. 

We show match{cut{i\p), |t|p]]) = true for all positions p G Pos{£) \{e} by induction on 
the structure of £. If £\p is a variable then cut{£\p) = ±, and we have match{J-,a) = true 
for all a G A. If £\p = f(si, . . . , s„), then t\p = f (ti, . . . , t„) and by induction hypothesis we 
have match{cut{si),ltil) = true. Hence maic/i(cui(f(si, . . . , s„)),f([[ti], . . . , [[t„]])) = true 
by definition of cut. Moreover, match(a,t) = true implies match (a, shrink (t, A)) = true 
for all a & A. By definition we have [[f(ti, . . . , tn)l = shrink {f{lti}, . . . , [[t„]]), ^), and hence 
match{cut{f{si, . . . , s„)), |f(ti, . . . ,t„)]]) = true. 

Let t = g(ni, . . . ,Um) and £ = g{wi, . . . ,Wm)- Then we know match{cut{wi), |{nj) = 
true. Hence match{cut{£) , gduij, . . . , lumj)) = true and t G C{A). D 
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8. Minimizing Algebras 
In this section we are concerned with the minimization of redex-algebras. The algorithm 



is similar to the minimization of deterministic tree automata, see CDG"'"07 . For the set 



of 291 TRSs of the outermost termination competition of 2008 |Ter08] . the redex-algebras 
constructed according to Definition 17.31 have an average size of 4.6 elements. After an 
application of the minimization algorithm described here, the average size falls to 3.4, a 
reduction of 27%. This reduction has a polynomial influence on the number of rules of the 
transformed system. 

Definition 8.1. Two core redex-algebras Ai, A2 are called equivalent if £(^1) = L(^A2). 

Lemma 8.2. Let A\, A2 he equivalent, core redex-algebras. Then Ai is sound or complete 
if and only if A2 has the respective property. □ 

For a given core redex-algebra we now construct a minimal equivalent algebra. The 
difference to the minimization of tree automata from CDG"'"07| lies in the initial equiv- 



alence Eq. For tree automata this initial equivalence consists of two partitions, the final 
and the non-final states. In our setting two states are initially equivalent if they cannot 
be distinguished using the isRedex functions, that is, isRedexf{x,a,y) = isRedexf{x ,b,y) 
for each symbol f € S and each assignment of x and y. This can yield any number of 
partitions between 1 and |^|. 

Definition 8.3 (Minimization of redex-algebra) . Let {A,isRedex) be a core redex-algebra 
over E. We define equivalence relations Ei for i € N on the elements of A. Initially two 
elements a,b £ A are equivalent, a Eq b, if: 

isRedexf{x,a,y) = isRedexf{x,b,y) 

for all symbols f G S„, j G {1, . . . , n}, x € A^^"^, and y € A^~^ . Then for i = 0, 1, . . . and 
a, 6 G >l we define a -Ej+i b \i a Eib holds and: 

m{x,a,y)E,iq{x,b,y) 

for all n-ary symbols f G S, j G {1, . . . ,n}, x G A^~^, and y G ^"^•'. The process halts 
when E'j+i = Ei for some i G N, and then we define E = E^. Let [a] denote the equivalence 
class of a G ^ with respect to E. The minimized redex-algebra of A, denoted A"^^'^, is 
defined as ^™" = {E, {-j^, isRedex^) where: 

[ff([ai],...,M) = [f(ai,...,a„)] 

isRedeXf ([ai], . . . , [a„]) = isRedexf{ai, . . . , a„) 

for each symbol f G S with arity n. 

Lemma 8.4. Let A be a core redex-algebra, then A is equivalent to A™'^^ . □ 

Example 8.5. We consider the TRS R consisting of the following three rules: 

f(i(a))^a f(j(a))^a f(a) ^ a 

The left-linear (full) redex-algebra for R is A = {a, i(a),j(a), ±} with the interpretation 
|a] = a, l[i]](3) = 1(3)1 01(3) = J(3)' ^'^d the interpretation is _L in all non-listed cases; 
isRedexf{x) = true for all x 7^ _L, and false, otherwise. 

The minimization algorithm starts with Eq = {{a, i(a),j(a)}, {_L}} as initial equiva- 
lence, since _L can be distinguished from the other elements {isRedexf{l-) = false). The 
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first iteration of the algoritlim yields Ei = {{a}, {i(a),j(a)}, {_L}} as l^\}{a) = i(a) wliereas 
|i](i(a)) = |Ii](j(a)) = ±. The elements i(a) and j(a) are indistinguishable, and so in the 
second iteration we obtain E2 = Ei. Thus the elements i(a) and j(a) are identified and we 
obtain an algebra that has one element less than the algebra we started with. 



9. Constructing Minimal and Maximal C-labelings 

In the previous sections we have constructed and minimized redex-algebras for recog- 
nizing redex positions. For completing the transformation we still have to explain how to 
construct C-labelings from the redex-algebras. 

In minimal labeling symbols are marked with a * if they correspond to redex positions 
and stay unlabeled otherwise. This labeling creates a small signature and thereby results 
in a small number of rules of the transformed system. 

Definition 9.1. Let i? be a TRS over S, and A a redex-algebra. The minimal labeling with 
respect to A is the C-labeling (^, vr, S''^'^) defined for each n-ary symbol f G Stop by: 

N I* a isRedexfiai, . . . ,an) = true 

TTf{ai,...,an) = < 

I e otherwise 

The set of redex symbols is defined by S = {f* | f € S}. 

Theorem 9.2. Let R be a TRS, and A a sound redex-algebra for R. The minimal labeling 
with respect to A is a sound C-labeling for R. 

Proof. Let t = f (ti, . . . , t„) G T{T., 0) such that root{lab{t)) € T/'""^ . Then by definition of 
minimal labeling we have isRedexf{\ti\., . . . , {tn}) = true. Hence t G C{A) and thus t is a 
redex by definition of sound redex-algebra. □ 

The construction and minimization of redex-algebras (Definitions 17.31 and 18. 3p give rise 
to sound minimal C-labelings (Theorem 17.71 Lemmas 18.41 and 18.21 and Theorem 19. 2p . In 
combination with Theorems 15.81 and 16.101 this provides us with sound transformations for 
proving outermost termination: 

Corollary 9.3. Let R be a TRS, and let {A, it , 'E'^^'^) be the minimal labeling with respect 
to the minimized left-linear redex-algebra for R. Then R is outermost ground terminating 
whenever the dynamic context extension A^R or the dynamic labeling 'I'^R is terminating. 

Minimal labeling is sound and efficient, but it is not complete (not even for left-linear 
TRSs where the left-linear redex-algebra is complete): 

Example 9.4. Let R be the term rewriting system consisting of the rules: 

inf(a;) -^ cons(a;, inf(s(x))) cons(s(x),y) -^ nil 

Obviously, R is outermost terminating. The minimized left-linear redex-algebra for R is: 

A = {s, ±} H(x) = s [nill = [infl(x) = [consKx,y) = ± 

for all x,y G A. The C-depth of both rules (with respect to A) is 0. Using minimal 
labeling we obtain 7rcons(s, x) = -k and VTjnf (x) = -k for all x G A, and other symbols are left 
unmarked (e). Thus the set of redex symbols is S = {inf*,cons*}. 
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The dynamic context extension A'^i? of R with respect to the C-labehng (^, 7r,S ) 
consists of the following rules, the first two of which arise from the inf-rule, with the values 
_L and s assigned to x respectively: 

inf*(x) -> cons(x,inf*(s(x))) 

inf*(x) -^ cons*(x,inf*(s(x))) 

cons*(s(x), y) — )• nil 

The replacement map is defined by /x(inf*) = /^(cons*) = 0. 
Now l^R admits an infinite derivation: 

inf*(j;) -^A^R,n cons(x, inf*(s(x))) -^a^r,h cons(x, cons(s(x), inf*(s(s(x))))) -^ . . . 

The third term is labeled incorrectly, as the inner occurrence of cons should be marked. 
The reason is that in the second step, instead of the first inf*-rule, the second should have 
been applied; however, the left-hand side inf*(x) contains too little information to 'decide' 
what the labeling of the right-hand side should be. 

This motivates the use of maximal labeling for which correct labeling is preserved under 
rewriting. Function symbols are labeled with the interpretation of their arguments: 

Definition 9.5. Let i? be a TRS over S, and let ^ be a redex-algebra for R. The maximal 
labeling with respect to A is the C-labeling {A, vr, S'"'^'') defined for each n-ary f G Stop by: 

7rf(ai,. . . ,a„) = (ai,... ,a„) 

The set of redex symbols is defined by: S = |f('^iv.an) | isRedexf{ai, . . . ,an) = true}. 

Theorem 9.6. Let R be a TRS, and let A be a redex-algebra for R. The maximal labeling 
with respect to A is a maximal C -labeling for R, and it is sound, complete, and core whenever 
A has the respective property. 

Proof. Maximality of the maximal labeling is immediate from the definition. Let ^ be a 
complete redex-algebra. Let t = f (ti, . . . , t„) G T(S, 0) be a redex. Then by definition of 
complete redex-algebra zsiiedexfdti], . . . , [[t„]]) = true, and it follows that root{lab{t)) = 
f{[*i]>---,[tnl> ^ S'"'^'^. Hence the C-labeling is complete. Analogous to the proof of Theo- 
rem [921 we obtain that maximal labeling is sound whenever the redex-algebra A is sound. 
Note that the remaining claim concerning coreness is immediate by definition. □ 

The construction and minimization of redex-algebras (Definitions 17.31 and 18. 3p give 
rise to sound and complete maximal C-labelings (Theorem 17.71 Lemmas 18.41 and 18. 2^ and 
Theorem l9.6p . In combination with Theorems l5.8|[5TT3] and [6.10l this provides us with sound 
transformations for proving outermost termination for arbitrary TRSs. For quasi- left-linear 
TRSs dynamic context extension is both sound and complete. 

Corollary 9.7. Let R be a TRS, and let {A,it,T,'^^'^) be the maximal labeling for the min- 
imized left-linear redex-algebra for R. Then R is outermost ground terminating whenever 
the dynamic context extension /^R or the dynamic labeling f^i? is terminating. Moreover, 
if R is quasi-left-linear, then R is outermost ground terminating if and only if the dynamic 
context extension A^R terminates. 

As a consequence, the full redex-algebra for an arbitrary TRS can be used to disprove 
outermost ground termination: 



TRANSFORMING OUTERMOST INTO CONTEXT-SENSITIVE REWRITING * 



35 



Corollary 9.8. Let R be a TRS, and let (^, 7r,S ) be the maximal labeling for the min- 
imized full redex-algebra for R. Then the dynamic context extension N' R is terminating 
whenever R is outermost ground terminating. 

Example 9.9. We revisit Example l9.4l but this time we give the dynamic context extension 
with respect to maximal labeUng: 

inf^(x) -^ cons^'^(a;,inP(s^(a;))) inP(x) -^ cons='^(a;,inP(s'(a;))) 

cons^'-'-(s"'"(x),y) —^ nil cons^'"'"(s^(x),y) —^ nil 

cons^'^(s^(x),y) —^ nil cons^'^(s^(x),y) —^ nil 

with ^(inf^) = /i(inP) = /^(cons^'-'-) = ^(cons^'^) = 0. This context-sensitive TRS is 
terminating as opposed to the one constructed in Example 19.41 To prove termination we 
give a strictly decreasing polynomial interpretation over the natural numbers: 

lnil]=0 lcons^'^}{x,y) = x + y [inf^](x) = x + 3 ls^}{x) = x 

|cons^'^](x,2/) = 1 |inf ](x) = 2 |sl(x) = x 



10. Evaluation 

With the implementation of the transformation by dynamic context extension, de- 
scribed in Section [5l the termination prover Jambox |End09| gained first place in the cat- 
egory of outermost rewriting of the termination competition of 2008 |Ter08| , see Table [TJ 
With an average time of 4.1 seconds per termination proof, Jambox was also faster than the 





score 


average time 


Jambox [End09l 


72 (93.5%) 


4.1s 


TrafO [RZ09J 


46 (59.7%) 


8.1s 


AProVE [GSKT06I 


27 (35.0%) 


10.8s 



Table 1: Results of proving outermost termination in the competition of 2008 [Ter08| . 



other participants, providing empirical evidence for the efficiency of the transformation of 
dynamic context extension. Not listed in Tabled] is TTT2 [KSZM09] . which did not prove 
outermost termination, but performed best in disproving outermost termination. 

The percentages listed in Table [1] are relative to the total number of term rewriting 
systems which were proven to be outermost terminating by some participating tool. The 
TPDB 2008 contained 291 TRSs in the category of outermost rewriting of which 77 were 
proven outermost terminating, 161 not outermost terminating, and 53 remained unsolved 
in the competition. We note that around 50 systems in the database are, strictly speaking, 
not term rewriting systems, as they contain variables in the right-hand sides that do not 
occur in the left-hand sides. 

In the termination competition 2008, Jambox used exclusively the approach of dynamic 
context extension (Section [5]). If we additionally use dynamic labeling, as defined in Sec- 
tion [6l the score of Jambox improves by 4, thus proving 76 systems to be outermost termi- 
nating. 
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The secret behind the efficiency of Jambox is threefold: First, we construct and minimize 
the algebras employed for marking redex positions, see Sections [7] and [H Secondly, we try 
two labeling strategies: minimal and maximal, see Section [9l Minimal labeling is very 
efficient and contributes to 75% of the success of Jambox. In order to have a complete 
transformation we also employ maximal labeling. Thirdly, dynamic context extension is 
the combination of labeling and context extension, where the prefixing of contexts to rules 
depends on the interpretation of the variables. All these optimizations minimize the number 
of rules and their size in the transformed systems, which is important to keep a manageable 
search space. 



maximum 
upper quartile 
median 
lower quartile 
minimum 



Next, we compare the performance of dynamic context extension and dynamic labeling 
(Section [6|). Figure [1] shows the size of the transformed systems in relation to the size 
of the input system, as measured on the TPDB |iTer08j . For each input size we display 
the minimum, the lower quartile (25th percentile), the median, the upper quartile (75th 
percentile), and the maximum size of the transformed systems. From Figure [1] it can be 
inferred that for larger input systems the dynamic labeling usually is a factor 5 or 10 smaller 
than the dynamic context extension. For systems with more than 10 rules there are only 
a few examples available in the database, which explains why some of the quartiles fall 
together. 




10 12 14 16 18 



2 4 6 8 10 12 14 16 18 



Figure 1: Size of the transformed systems (y-axis) in relation to the size of the input TRS 
(2;-axis) using dynamic context extension (left), and dynamic labeling (right), 
both with maximal C-labelings. 



Table [2] shows a comparison of our different methods (dynamic context extension and 
dynamic labeling, combined with minimal or maximal labeling). Each row lists the total 
score of one method with the number of systems it can solve that cannot be solved by the 
method corresponding to the column. For example, the value 3 in Table [2] means that three 
systems can be solved by dynamic maximal labeling, but not by dynamic context extension 
in combination with maximal labeling. The table shows that dynamic context extension 
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Method 


Total Score 


^A'^i?, max 


^t''^, max 


^A'"/?, min 


^t''^, min 


l^R, max 


71 





5 


16 


25 


YR, max 


69 


3 





16 


21 


^R, min 


57 


2 


4 





9 


f^i?, min 


50 


4 


2 


2 






Table 2: Comparison of the proposed methods on the TPDB 2008. 

and dynamic labeling are roughly equal in strength, and that maximal labeling gives the 
best results. 

Table [3] illustrates that the C-depth of rules is typically small (employing left-linear 
redex-algebras): it is or 1 in 94.5% of the cases. Note that 54% of the rules have C-depth 0, 
but this does not mean that the same percentage of the TRSs could be handled by a model 
(Definition [32]). Only 14% of the TRSs have C-depth 0. 



C-depth 





1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


#rules 


54% 


40.5% 


3.3% 


1.3% 


0.5% 


0.2% 


0% 


0% 


0% 


0.1% 


0% 



Table 3: Ratio of rules having a certain C-depth in the TPDB 2008. 



11. Discussion 

For arbitrary TRSs the transformation based on dynamic context extension (including 
the construction of C-labelings) is sound, and for quasi-left-linear TRSs it is sound and 
complete. The sound redex-algebra we construct recognizes redexes with respect to left- 
linear rules. As a consequence, in the /iTRS N^R rewriting is forbidden only inside such 
redex positions. This corresponds to a weakening of the outermost rewriting strategy: 
contraction of a redex is disallowed only if it is contained within a redex with respect to 
left-linear rule. Let us call this the 'left-linear outermost' rewriting strategy. Dynamic 
context extension combined with maximal labeling is sound and complete for termination 
with respect to this rewriting strategy for all TRSs. 

In a similar way the transformation of [RZ09] can be generalized from quasi-left-linear 
TRSs to arbitrary TRSs. For soundness the anti-matching rules do not need to exactly 
match the non-redex terms, as long as at least all non-redex terms are matched. Then the 
symbol down can be moved inside redexes with respect to rules which are not left-linear. 
This enables only additional rewrite steps but does not harm soundness. More precisely, 
using this generalization the transformation of |RZ09| becomes complete with respect to left- 
linear outermost termination. Thereby the score of TrafO in the termination competition of 
2008 |Ter08] could possibly have been improved by around 20%, resulting in a score of 57 
instead of 47. 

We have shown that the transformation of dynamic labeling is complete on the set of 
correctly labeled terms /a6(T(S,0)) without the auxiliary relabel symbols (Theorem 16. 12p . 
The non-completeness with respect to termination on all terms arises from 'illegally placed' 
relabel symbols in combination with duplicating rules, see Example 16.131 The duplicating 
rules can multiply the illegal symbols and make them reusable over and over again. To 
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prevent this, one can introduce an extra symbol block with ;u(block) = for disaUowing 
relabel symbols beneath the rule application. For this purpose, we wrap each duplicated 
variable in the right-hand side of a labeled rule into a context block(n), and we extend the 
dynamic labeling with rules of the form block(f(xi, . . . ,Xn)) — )■ f(block(xi), . . . , block(rE„)) 
for each symbol f G lab{T,n) (excluding relabel symbols!). Note that this implies that 
block symbols disappear when meeting a constant. For instance, reconsider the TRS from 
Example 16.131 which had among others the following rule in its dynamic labeling: 



-L^f-l, 



f'"^'^(c,y)^h-^(f^'^(y,y)) 



This rule would be modified to: 

f^^'^(c,y) ^ h^(f^'^(block(y),block(y))) 

In this way we 'block' each duplicated variable. 

Another question is whether there are interesting labelings between minimal and max- 
imal. In particular, are there more efficient complete labelings? Here efficiency is measured 
in the size of the signature and the number of rules of the transformed system. In Ex- 
ample 19.91 it would have been sufficient to label cons with the interpretation of the left 
argument, saving two symbols and two rules of the transformed system. 
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